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The present invention relates to recombinant heterochimeric paraxnyxoviridae 
glycoproteins and their expression in eukaryotic cells, particularly in Chinese 
5 Hamster Ovary (CHO) cells. The invention further relates to methods for 
constructing and expressing such heterochimeric proteins, intermediates for use 
therein, methods to optimize the codon usage of the nucleic acid sequences which 
encode such heterochimeric proteins and the use of the recombinant proteins as 
vaccines for the prevention of diseases caused by paramyxoviridae pathogens. 

10 

The mumps (MuV), Measles (MV), the parainfluenza type I (PIV1), type II (PIV2) 
and type III (PIV3) and the respiratory syncytial (RSV) virus belong to the 
paramyxoviridae family. The MuV is classified in the rubulavirus subclass, the MV 
is classified in the Morbilli virus subclass, the parainfluenza viruses (PIV1, PIV2 
15 and PIV3) are classified in the paramyxovirus subclass while the RSV is attached to 
the pneumo virus subclass. 

RSV is the most important cause of viral lower respiratory tract disease in infants 
and children. The fusion (F) and the attachment (G) protein which are both viral 
20 surface glycoproteins appear to be of potential value for the development of a 
vaccine against RSV. 

The fusion protein F of RSV contains 574 amino acid residues; amino acids 1 to 21 
correspond to the signal peptide and residues 525 to 549 to the membrane anchor 
25 domain. The molecule presents five potential sites for glycosylation. The F protein 
is synthesized as a 70 kDa precursor (F 0 ) which undergoes proteolytic maturation to 
yield the F| subunit (48 kDa) and F 2 (23 kDa) linked via disulfide bridges. The 
protein F, when injected into animals, leads to the production of neutralizing 
antibodies and may induce cytotoxic lymphocytes (CTLs). 

30 

The attachment or G protein of RSV contains 298 amino acid residues and is 
heavily glycosylated since half of its molecular mass (90 kDa) is contributed by 
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oligosaccharide side chains, chiefly in the form of O-linked sugars. It has been 
shown that the G protein, when injected into animals, provides protection against 
homologous but not heterologous subgroup virus challenge. This protein is 
extremely variable and there is only a stretch of 13 amino acid residues which is 
5 conserved in all RSV. 

The PIV3 is second to RSV as a major agent of severe viral respiratory tract 
infections in infants. The fusion protein F of PIV3 contains 539 amino acid 
residues; amino acids 1 to 18 correspond to the signal peptide and residues 494 to 

10 516 to the membrane anchor domain. The molecule presents 4 potential sites for 
glycosylation. The F protein is synthesized as a 70 kDa precursor (F 0 ) which 
undergoes proteolytic maturation to yield the Fj (56 kDa) and F 2 (14 kDa) subunits 
linked via disulfide bridges. The protein F, when injected into animals, leads to the 
production of neutralizing antibodies. The F protein is involved in cell fusion during 

15 viral infection and carries an hemolysin activity. Used alone for immunization, the 
F protein generates an immune response which is insufficient to confer protection 
against a challenge with the vims. Complete protection is only acquired by 
concomitant immunization with the attachment protein HN, another glycoprotein of 
PIV3. 

20 

The protein HN carries hemagglutinin and neuraminidase activities. It is composed 
of 572 amino acids; its membrane anchor domain occurs in the N-terminal end of 
the molecule, between amino acid residues 32 and 53. Four potential sites for 
glycosylation have been identified. Injection of protein HN into animals generates 
25 an immune response and neutralizing antibodies. These antibodies however do not 
protect completely against a challenge with the virus. Full protection is obtained 
only by concomitant immunization with the F protein of PIV3. 

The PI VI virus was initially isolated from young children suffering from disorders 
30 of the lower respiratory tract. Infection with PIV1 causes the majority of cases of 
croup found for ail infections caused by paramyxoviruses. Viral transmission of 
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PIV1 is by person to person contact or by aerosol, although the virus does not 
persist in the environment for long. 

Like PIV2 and PIV3, the PIV1 vims has two surface glycoproteins, the fusion 
5 protein (F) and the attachment protein (HN). These two proteins are the priority 
targets for the development of a subunit vaccine, the properties of which would be 
to ensure protection of children from the very first months of life and to prevent 
reinfection, or at least to prevent the serious complications by restricting viral 
development to the upper respiratory tract where the consequences would be benign 
10 (common cold). 

PIV2 also affects very young children and causes the same type of respiratory 
discorders, essentially croup, but of less severity. The PIV2 virus has two surface 
glycoproteins (F and HN), which are potential targets for the development of a 
15 subunit vaccine. 

The measles virus is an extremely contagious agent which establishes itself in the 
epithelial cells of the respiratory tract, the oropharynx or the conjunctiva. The 
infection causes fever, cough, head-cold, conjunctivitis and a characteristic 
20 generalised rash. 

There is no appropriate inactivated vaccine against measles but an effective 
attenuated live vaccine is available and is generally used in combination with the 
attenuated live vaccines against rubella and mumps. This live vaccine protects 

25 against the disease for at least 20 years. The measles virus has two surface 

glycoproteins, which are potential targets for the development of a subunit vaccine. 
The fusion protein (F) is a 550 amino acid long glycosylated molecule and, as for 
the other paramyxovirus, has to undergo proteolitic cleavage to yield F, and F 2 
subunits that are linked via disulfide bridges. This molecule, which carries a 

30 haemolysin activity, generates an immune protective response when injected into 
animals. The attachment protein (H), is a 617 amino acid long glycosylated protein, 
which carries a hemagglutinin activity. This protein leads, when injected into 
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animals, to the production of neutralizing antibodies that are able to inhibit 
hemagglutination. This immune response protects the animal against a viral 
challenge. 

5 The mumps virus is a pathogen causing the contagious infantile illness which 
consists of the inflammation of parotid glands. During the incubation period 
following infection, the virus replicates in the respiratory epithelium then 
disseminates into secretary ducts of the parotid glands. Other glands may become 
infected thereafter and numerous cases of meningitis have been reported. Among 
10 complications related to the infection, encephalitis is a serious one, with a mortality 
rate of about 1%; deafness cases have also been reported. 

A vaccine against mumps is available: it is made of an attenuated live virus, 
produced by culturing infected embryonic chicken cells. The vaccine leads to the 
15 seroconversion in vaccinated individuals and protects against infection in more than 
95 % of seronegative persons. The vaccine thus reduces significantly the frequencies 
of complications. 

In a number of cases, however, viral infection is not detected because the effects 
20 remain subclinical. Young children and aged people are most likely to develop 

complications from mumps infection. In view of the inherent risks related to the use 
of attenuated live vaccines, such as the potentiation of the illness upon natural 
surinfection in vaccinated individuals, it is desirable to improve the safety of the 
vaccine, particularly for the groups at risk. 

25 

The fusion protein F of mumps virus contains 538 amino acid residues; amino acids 
1 to 26 correspond to the signal peptide and residues 483 to 512 to the membrane 
anchor domain. The molecule presents 7 potential sites for glycosylation. The F 
protein is synthesized as a 65-74 kDa precursor (F 0 ) which undergoes proteolytic 
30 maturation to yield the F, (58-61 kDa) and F 2 (10-16 kDa) subunits linked via 
disulfide bridges. The protein F is involved in cell fusion during viral infection, 
carries an haemolysin activity and plays a role for viral penetration into cells. It 

-4- 
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does not however carry the antibody dependent cellular cytotoxicity (ADCC) as 
observed for another mumps virus glycoprotein, HN. 

The protein HN (molecular weight 74-80 kDa) carries hemagglutinin and 
5 neuraminidase activities which are involved in virus attachment to cells and in the 
disruption of the host cell membranes. Protein HN (attachment protein or 
hemagglutinin-neuraminidase) generates neutralizing antibodies and appears 
important for the development of ADCC. Protein HN is composed of 582 amino 
acids; it carries a N-terminal anchor domain (residues 33 to 52) and 9 potential sites 
10 for glycosylation. 

For the viruses considered above, it appears that concomitant immunization with 
both membrane glycoproteins F and HN, or G in the case of RSV, are required to 
achieve full protection in the animal model. Chimeric proteins containing both the F 
15 and G proteins of RSV, or the F and HN proteins of PIV3 have shown complete 
protection against RSV or PIV3 challenge in cotton rats (Brideau et al, J Gen Virol, 
1989, 70 2637-2644 and Brideau et al, J Gen Virol, 1993, 74, 471-477). 

W093 14207 (Connaught) describes heterochiraeric proteins comprising RSV and 
20 PIV3 proteins including F(RSV)xHN(PIV3) and F(PIV3)xG(RSV) hybrids, and 
suggests that such proteins can be expressed from a variety of host cells including 
bacterial, mammalian, insect, yeast and fungal cells. The specific examples 
describe expression in insect Sf9 and High 5 cells and mammalian Vero cells. 
There is no specific disclosure of the use of CHO cells. The use of Sf9 and High 5 
25 cells is also described by Du et al, BIO/TECHNOLOGY 12,1994, 813-818. 

Homa et al (Upjohn), J Gen Virol, 1993, 74, 1995-1999 describes another 
heterochimeric protein, F(RSV)xHN(PIV3) expressed in insect cells using a 
recombinant baculovirus. 



30 



Homochimeric paramyxoviridae glycoproteins have also been described by several 
workers: - 
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WO8905823 (Upjohn) describes RSV FxG and GxF hybrids which can be expressed 
from bacterial, yeast, mammalian and insect cells. Example 7 describes the 
expression of an RSV FxG protein from CHO cells although there are no details of 
5 how successful such expression is. 

W089 10405 (Upjohn) describes PIV3 FxHN and HNxF hybrids which can be 
expressed from bacterial, yeast, mammalian and insect cells. Example 6 describes 
the expression of a PIV3 FxHN protein from CHO cells, however no details are 
10 given quantifying the extent of expression and secretion. 

Lehman et al (Upjohn), J Gen Virol, 1993, 74, 459-469 describes the expression of 
PIV3 FxHN in insect cells using recombinant baculovirus vectors as well as in CHO 
cells. 

15 

WO9306218 (SmithKline Beecham Biologicals) describes PIV3 FxHN hybrids 
which can be expressed in eukaryotic cells including vaccinia, CHO or Vera cells. 
Example B)2 describes the expression of a FsVxHNa' hybrid in CHO cells and 
indicates that the product was almost evenly distributed between cells and medium. 
20 No details are however given quantifying the extent of expression and secretion. 

WO9425600 (SmithKline Beecham Biologicals) describes MuV FxHN and HNxF 
hybrids which can be expressed in vaccinia, a mammalian cell (such as CHO) or a 
bacterial cell. Examples B) 3 and 4 describe the expression of s + FHNa'xFa' and 
25 Fs + a xHNa in CHO cells however no details are given describing the extent of 
expression and secretion. 

Although this cited art may suggest that homochimeric paramyxoviridae 
glycoproteins can be expressed in a variety of cell lines including CHO cells it has 
30 now been discovered that in fact expression and secretion from CHO cells is not 
always successful and success cannot be predicted. Thus it has now been 
demonstrated that although a RSV F x G hybrid could be successfully expressed and 
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secreted in CHO cells, analogous homochimeric hybrids from PIV3 and MuV could 
not in fact be expressed in CHO cells in such manner that they could be purified 
from the supernatant in significant quantities. 

5 Surprisingly, it has now been discovered that heterochimeric hybrids can be 
successfully expressed and secreted in both CHO and insect cells. 

Accordingly in a first aspect the present invention provides a process for preparing a 
heterochimeric protein or an immunogenic derivative thereof comprising an 
10 immunogenic fragment of the fusion (F) protein of RSV, PIV1 , PIV2, PIV3, MV 
or MuV and an immunogenic fragment of the attachment (G, HN or H) protein of 
RSV, PIV1, PIV2, PIV3, MuV or MV which process comprises expressing 
recombinant DNA encoding the heterochimeric protein or immunogenic derivative 
thereof in CHO cells and recovering the protein. 

15 

By heterochimeric protein is meant one that does not contain a fusion or attachment 
protein from the same pathogen. 

This invention also provides novel heterochimeric proteins not previously described 
20 in WO 93 14207 which can be prepared using the process of the present invention. 

Thus, in a second aspect the present invention provides a heterochimeric protein or 
an immunogenic derivative thereof comprising an immunogenic fragment of the 
fusion (F) protein of RSV, PIV1, PIV2, PIV3, MV or MuV and an immunogenic 
25 fragment of the attachment (G, HN or H) protein of RSV, PIV1 , PIV2, PIV3, MuV 
or MV, with the proviso that where one of the immunogenic fragments is derived 
from RSV F, RSV HN or PIV3 F, PIV3 HN, the other of the immunogenic 
fragments is derived from MuV F, MuV HN, MV F, MV H, PIV1 F.PIVl HN, 
PIV2 F or PIV2 HN. 



30 



By an immunogenic fragment of the fusion (F) protein of RSV, PIV1, PIV2, PIV3, 
MV or MuV is meant a part of the protein which contains at least one antigenic 
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determinant capable of raising an immune response specific to the F protein of 
RSV, PIV1, PIV2, PIV3, MV or MuV respectively. Included within this definition 
is the full length F protein, preferably however the immunogenic fragment is 
lacking the membrane anchor domain at its C-terminal end. 

5 

By an immunogenic fragment of the attachment protein (G, HN or H) of RSV, 
PIV1, PIV2, PIV3, MuV or MV is meant a part of the protein which contains at 
least one antigenic determinant capable of raising an immune response specific to 
the G protein of RSV, to the HN protein of PIV1 , PIV2, PIV3, MuV or the H 
10 protein of MV respectively. Included within this definition is the full length G or 
HN protein, preferably however the immunogenic fragment is lacking the 
signal/anchor domain at its N-terminal end. 

Preferably the heterochimeric protein is linked via an amino acid in the C-terminal 
15 part of the immunogenic fragment of the F protein of RSV, PIV1 , PP/2, PIV3, 
MV or MuV to an amino acid in the N-terminal part of the immunogenic fragment 
of the G protein of RSV, the HN protein of PIV1 , PIV2, PIV3, MuV or the H 
protein of MV. 

20 Suitably the heterochimeric protein commences at its N-terminal end with a signal 
sequence from the F protein of RSV, PIV1, PF/2, PIV3, MV or MuV. 
Conveniently this will be pan of the corresponding immunogenic fragment of the F 
protein of RSV, PIV1, PIV2, PIV3, MV or MuV when this fragment is linked via 
its C-terminal end to the N-terminai end of the immunogenic fragment of the G 

25 protein of RSV, the HN protein of PIV1, PIV2, PIV3, MuV or the H protein of 
MV. 

Alternative signal sequences may also be employed. For example, the 
heterochimeric protein suitably commences at its N-terminal end with a signal 
30 sequence of tissue plasminogen activator (TP A). 
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In order to enhance the level of expression the heterochimeric protein may further 
comprise a ubiquitin leader sequence which is suitably positioned after any signal 
sequence as hereinbefore described. Preferably the ubiquitin leader sequence is 
linked to the C-terminal end of the signal sequence of TPA. 

5 

Preferably the ubiquitin leader sequence is derived from yeast, for example as 
described in Ecker et al, J.Biological Chemistry, 1988, 264(13), 7715-7719, 

Suitably a cleavage site is positioned between the C-terminal end of the ubiquitin 
10 sequence and the N-terminal end of the immunogenic fragment of the F protein of 
RSV, PIV1, PIV2, PIV3, MV or MuV. 



In order to facilitate chromatographic purification the heterochimeric protein 
suitably comprises a polyhistidine tail, for example as described in Hochuli et al, 

15 BIO/TECHNOLOGY, 1988, 1321-1325. The polyhistidine tail preferably 

comprises from 2 to 6 adjacent histidine residues which is suitably attached at the C 
terminal end of the heterochimeric protein. Preferably a cleavage site is positioned 
between the polyhistidine tail and the C-terminal end of the immunogenic fragment 
of the G protein of RSV, the HN protein of PIV1, PIV2, PIV3, MuV or the H 

20 protein of MV. 



The cleavage site for the ubiquitin sequence and/or the polyhistidine tail may be 
chemical or enzymatic and preferably is an enterokinase cleavage site, for example 
as described in LaVallie et al, BIO-TECHNOLOGY, 1993, 187-193. 

Following expression and purification, treatment with an enterokinase will cleave 
off any ubiquitin and/or polyhistidine sequence releasing the desired heterochimeric 
protein. 



30 Particular heterochimeric proteins of this invention include: 

the F protein of RSV lacking its membrane domain linked at its C-terminal end to 
the HN protein of MuV lacking its signal/anchor domain herein referred to as: 

-9- 
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Fs + a'RSVxHNsa'MuV, as well as 
Fs + a*PIV3 x HNs'a* MuV; 
Fs + a*MuVxGs*aRSV; and 
Fs + a* MuV x HNs'a*PIV3, and 
5 immunogenic derivatives thereof. 



The present invention also provides particular heterochimeric proteins which 
include: 

Fs + a*MuVxHs aMV; or 
10 Fs + aRSVxHNsa'PIVl ; or 

Fs + a*RSVxHNs*a'PIV2, and 

imunogenic derivatives thereof. 



The present invention also provides heterochimeric proteins comprising RSV and 
15 PP/3 proteins not specifically disclosed in W093 14207, which advantageously can 
be expressed from CHO cells. 
These are: 

Fs + a (1-526) RSV x HNs'a" (70-572) PIV3; 
FsV (1-492) PIV3 x Gsa' (69-298) RSV; 
20 Fs + a* (1-526) RSV x HNs'a* (70-572) PIV3 bis; 

Fs + a (1-526) RSV x HNs'a (70-572) PIV3 ent his, and 

sTPA (1-21) UB (1-74) ent Fsa (24-526) x HN s'a'(70-572) PIV3, and 

immunogenic derivatives thereof. 



25 The heterochimeric proteins of the present invention are immunogenic. The term 
immunogenic derivative as used herein encompasses any molecule which is a 
heterochimeric polypeptide which is immunologically reactive with antibodies raised 
to the heterochimeric protein of the present invention or parts thereof or with 
antibodies recognising the F protein of RSV, PIV1, PIV2, PIV3, MV or MuV, the 

30 G protein of RSV, the HN protein of PIV1, PIV2, PIV3, MuV, the H protein of 
MV, the RSV virus, the PIV1 virus, the PIV2 virus, the PIV3 virus, the MV virus 
or the MuV virus, or which, when administered to a human, elicits antibodies 
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recognising the F protein of RSV, PIV1, PIV2, PIV3, MV or MuV, the G protein 
of RSV, the HN protein of PIV1, PIV2, PIV3, MuV, the H protein of MV, the 
RSV virus, the PIV1 virus, the PIV2 vims, the PIV3 virus, the MV virus or the 
MuV virus. In particular immunogenic derivatives which are slightly longer or 

5 shorter than the heterochimeric proteins of the present invention may be used. Such 
derivatives may, for example, be prepared by substitution, addition, or 
rearrangement of amino acids or by chemical modifications thereof including the 
coupling or for enabling the coupling of the heterochimeric proteins to other carrier 
proteins such as tetanus toxoid or Hepatitis B surface antigen. All such substitutions 

10 and modifications are generally well known to those skilled in the art of peptide 
chemistry. 



Immunogenic fragments of the heterochimeric proteins which may be useful in the 
preparation of vaccines may be prepared by expression of the appropriate gene 
15 fragments or by peptide synthesis, for example using the Merrifield synthesis (The 
Peptides, Vol 2., Academic Press, New York, p3). 

In a further aspect of the invention there is provided recombinant DNA encoding 
the heterochimeric protein of the invention. The recombinant DNA of the invention 
20 may form part of a vector, for example a plasmid, especially an expression plasmid 
from which the heterochimeric protein may be expressed. Such vectors also form 
part of the invention, as do host cells into which the vectors have been introduced. 



In order to construct the DNA encoding a heterochimeric protein according to the 
25 invention, cDNA containing the coding sequences of the RSV, PIV1, PIV2, PIV3, 
MV or MuV fusion and attachment proteins and optionally of the ubiquitin, 
polyhistidine and enterokinase cleavage sites may be manipulated using standard 
techniques [see for example Maniatis T. et al Molecular Cloning, Cold Spring 
Harbor Laboratory, Cold Spring Harbor N.Y. (1982)] as further described 
30 hereinbelow. 
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In another aspect of the invention there is described a process of enhancing the 
protein expression in mammalian cells by optimization of the codon usage of the 
nucleic acids transfected therein. Optimization of the codon usage involves the 
replacement of at least one non-preferred or less preferred codon in a natural gene 
5 encoding a heterochimeric protein by a preferred codon encoding the same amino 
acid. Highly mammalian-expressed genes have C or G at their degenerative 
position (third base in the codon) whereas the RSV or PIV3-prevalent codons have 
A or T. At least one codon, and more prefereably all the codons of the RSV or 
PIV3 protein can be changed to fit at best the human usage, that is, the one (or 
10 ones) that is the most prevalent as shown below. 



Ala: GCC 


Cys: TGC 


His: CAC 


Met: ATG 


Thr: ACC 


Arg: CGC 
AGG 
CGG 


Gin: CAG 


lie: ATC 


Phe: TTC 


Trp: TGG 


Asn: AAC 


Glu: GAG 


Leu: CTG 


Pro: CCC 


Tyr: TAC 


Asp: GAC 


Gly: GGC 


Lys: AAG 


Ser: AGC 
TCC 


Val: GTG 



Each amino acid encoded by one of these codons are then considered humanised. 
The ratio between the number of humanised codons versus the total number of 
amino acids gives a percentage of humanisation as shown below. 



140/526 = 27% 

403/526 = 77% 

489/526 = 93% 

258/1029 = 25% 

528/1029 = 51% 

96% 
25 



1) F RSV (l-S26>orifinil 

20 2) F RS v<i-»M)h,iiMi>u 0 ! + (424-526)original 

3) F RSV (l-526)humaniscd 

4) F]ttV(l-S26)0!i|iml + ^ PIV3 (70-572) origtaal 

5 ) F RSV (l-lliyhunniuA + HN mj (70 _ J72) orjginl| 

6) F RSV (l'326)bununua! + HN ?m o0 . m) hulMnjK(| 
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The invention also provides DNA encoding a heterochimeric protein or 
immunogenic derivative thereof in which the codon usage of one or more nucleic 
acids has been substantially optimised and a process for expressing said DNA in a 
CHO or insect cell. 

5 

There have been a number of reports that have described a substantial amelioration 
of protein expression in mammalian cells after re-engineering the nucleic acid 
sequence of the heterologous protein to fit the codon usage found in highly 
expressed human genes (Haas J., Park E-C. and Seed B., Codon usage limitation in 
the expression of HiV-1 envelope glycoprotein, Current Biology, 1996, 6, n°3, 
315-325 ; Kim C. H., Oh Y. and Lee T.H., Codon optimization for high-level 
expression of human erythropoietin (EPO) in mammalian cells, Gene 199, 1997, 
293-301 ; Zolotukhin S., Potter M. Hauswirth W.W. Guy J. and Muzyczka N. A 
Humanized green fluorescent protein cDNA adapted for high level expression in 
15 mammalian cells. J. of Virology, July 1996, 70, n°7, 4646-4654). 

Vectors comprising such DNA, hosts transformed thereby and the truncated or 
hybrid proteins themselves, expressed as described hereinbelow all form part of the 



10 



20 



invention. 



For expression of the proteins of the invention, plasmids may be constructed which 
are suitable either for transfer into vaccinia virus or transfection into CHO cells, 
insect cells or Vera cells. Suitable expression vectors are described hereinbelow. 
Preferably the proteins of the present invention are expressed in CHO or insect 
25 cells. 



For expression in vaccinia a vaccinia transfer plasmid such as pULB 5213 which is 
a derivative of pSCll (Chakrabati et al, Molecular and Cellular Biology 5, 3403 - 
3409, 1985) may be used. In one aspect the protein may be expressed under the 
30 control of the vaccinia P7 5 promoter. 
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For expression in CHO-K1 cells a glutamine synthetase (GS) vector such as pEE14 
may suitably be used so that the protein is expressed under the control of the major 
immediate early promoter of human cytomegalovirus (hCMV-MIE). Alternatively 
a vector which allows the expression of the coding module as a polycistronic 
5 transcript with the neo selection gene may suitably be used. In one preferred aspect 
the coding module is under the control of the Rous Sarcoma Long Terminal Repeat 
(LTR) promoter. 

Preferably the plasmid for expression in CHO-K1 cells carries a GS expression 
10 cassette suitable for gene amplification using methionine sulphoximine (MSX). 
Alternatively the plasmid for expression in CHO-K1 cells carries a DHFR 
expression cassette suitable for gene amplification using methotrexate (MTX). 

Preferably expression of the heterochimeric protein of the present invention is 
15 carried out in the presence of sodium butyrate and/or dimethyl sulphoxide (DMSO) 
which may enhance gene expression. 

For expression in insect cells a shuttle vector such as pAcUW51 or pAcGP67 may 
be used. In one aspect the protein may be expressed under the control of the 
20 baculovirus plO promoter or the polyhedrin promoter. 

The expression system may also be a recombinant live microorganism, such as a virus 
or bacterium. The gene of interest can be inserted into the genome of a live 
recombinant virus or bacterium. Inoculation and in vivo infection with this live vector 

25 will lead to in vivo expression of the antigen and induction of immune responses. 
Viruses and bacteria used for this purpose are for instance: poxviruses (e.g; vaccinia, 
fowlpox, canarypox), alphaviruses (Sindbis virus, Semliki Forest Virus, Venezuelian 
Equine Encephalitis Virus), adenoviruses, adeno-associated virus, picomaviruses 
(poliovirus, rhinovirus), herpesviruses (varicella zoster virus, etc), Listeria, Salmonella, 

30 Shigella, BCG. These viruses and bacteria can be virulent, or attenuated in various 
ways in order to obtain live vaccines. Such live vaccines also form part of the 
invention. 
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5 



In yet another aspect of the invention there is provided a vaccine composition 
comprising a heterochimeric protein or immunogenic derivative thereof according to 
the invention in combination with a pharmaceutically acceptable carrier, a protein 
according to the invention for use in vaccinating a mammal and the use of a protein 
according to the invention in the preparation of a vaccine. 



Optionally, and advantageously, the vaccine of the present invention is combined 
with other immunogens to afford a polyvalent vaccine. In a preferred embodiment 
10 the heterochimeric protein is combined with other subcomponents of RSV, PIV1, 
PIV2, PIV3, MuV or MV, e.g. the single proteins F, G, HN or H or homochimeric 
proteins such as RSV FxG, PIV3 FxHN or MuV FxHN. 

In a particular aspect the invention further provides a vaccine composition 
15 comprising a protein according to the invention together with a suitable carrier or 
adjuvant. 

Vaccine preparation is generally described in New Trends and Developments in 
Vaccines, edited by Voller etal. University Park Press, Baltimore, Maryland, 
20 U.S.A., 1978. Encapsulation within liposomes is described, for example by 
Fullerton, U.S. Patent 4,235,877. 

In the vaccine of the present invention , an aqueous solution of the protein(s) can be 
used directly. Alternatively, the protein, with or without prior lyophilisation, can 

25 be mixed, absorbed or adsorbed with any of the various known adjuvants. Such 
adjuvants include, but are not limited to, aluminium hydroxide, muramyl dipeptide 
and saponins such as Quil A. Particularly preferred adjuvants are MPL 
(monophosphoryl lipid A) and 3D- MPL (3 deacylated monophosphoryl lipid A) [US 
patent 4,912,094], optionally formulated with aluminium hudroxide (EP 0 689 454) 

30 or oil in water emulsions (WO 95/17210). A further preferred adjuvant is known as 
QS21 which can be obtained by the method disclosed in US patent 5,057,540. Use 
of 3D-MPL is described by Ribi a al. in Microbiology (1986) Levie et al. feds) 
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Amer. Soc. Microbiol. Wash. D.C., 9-13. Use of Quil A is disclosed by Dalsgaard 
et a/., (1977), Acta Vet Scand, 18 9 3 49. Use of combined 3D-MPL and QS21 is 
described in WO 94/00153 (SmithKline Beecham Biologicals s.a). QS21 may be 
advantageously formulated with cholesterol containing liposomes, wherein 3D-MPL 
5 is present either in solution or incorporated in the membrane, as described in WO 
96/33739. 

As a further exemplary alternative, a heterochimeric protein of the invention or an 
immunogenic fragment thereof can be encapsulated within microparticies such as 

10 liposomes or associated with oil-in-water emulsions. Encapsulation within 
liposomes is described by Fullerton in US patent 4,235,877. In yet another 
exemplary alternative, a heterochimeric protein according to the invention or an 
immunogenic fragment thereof can be conjugated to an immunostimulating 
macromolecule, such as killed Bordetella or a tetanus toxoid. Conjugation of 

15 proteins to macromolecules is disclosed, for example by Likhite in patent 4,372,945 
and Armor et al. in US patent 4,474,757. 

The amount of the protein of the present invention present in each vaccine dose is 
selected as an amount which induces an immunoprotective response without 

20 significant, adverse side effects in typical vaccines. Such amount will vary 

depending upon which specific immunogen is employed and whether or not the 
vaccine is adjuvanted. Generally, it is expected that each dose will comprise 
l-1000ng of protein, preferably 1-200 \xg. An optimal amount for a particular 
vaccine can be ascertained by standard studies involving observation of antibody 

25 titres and other responses in subjects. 

The following examples and the attached figures (explained below) illustrate the 
invention. 

30 In the Figures: 

Figure 34 A shows the impact of humanisation on the level of expression of FrHNp, 
where: 
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FhHNElO = product expressed by the pEEl4FhHN transfected clone E10; 
FhHNE7 = product expressed by the pEE14FhHN transfected clone E7; 
FHNbis = product expressed by the pEE14FHN transfected clone; 
+but = 2mM Nabutyrate has been added to the cell medium, 3 days before 
5 harvest; 

pEE14 = negative control; 

Fdroso = pruified Fa- (drosophila derived); the standard protein in this ELISA 
assay wherein lul of standard corresponds to lng of product. 
Figure 34B shows humanisation impact on the level of expression of F^HNpivj, 
10 where the level of expression was determined by ELISA. Fdroso = purified Fa- 
(drosophila derived) that is the standard protein in this ELISA assay, lul of standard 
corresponds to lng of product. 
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EXAMPLES 

Example 1 

In order to vaccinate with a single immunogen, heterochimeric DNA molecules 
5 were constructed combining extracellular domains of the F and the attachment 
protein for each virus. DNA constructs for the PIV3 and MuV have already been 
described in WO9306218 and WO9425600, respectively. The DNA molecule 
combining the extracellular domains of the RSV F and G proteins were constructed 
as described below. 

10 

The DNA pieces were first inserted into the mammalian expression vector based on 
the replicon of the Semliki Forest Virus (pSFVl). This expression system does not 
lead to a stable expression mammalian cell line but, however gives an indication 
whether or not the chimeric protein is expressed and whether the product is 
15 effectively secreted in the culture medium, which is advantageous for the 
purification procedure. 

Stable expression in the culture medium of mammalian cell lines is preferred to 
obtain good quality and quantities of paramyxovirus glycoproteins. All the chimeric 

20 modules have been inserted in the shuttle vector, the pEE14, which integrates in the 
genome of mammalian cells such as CHO-K1. A quite good expression level was 
obtained with the RSV FxG homochimeric recombinant protein, however negligible 
expression was obtained for the FxHN recombinant homochimeric protein of either 
PIV3 or MuV. Expression of heterochimeric proteins was obtained from CHO 

25 cells. 

Thus by constructing heterochimeric DNA molecules combining the extracellular 
domains of the F protein of one virus linked to the extra cellular domain of the HN 
or G protein of another virus and inserting them into the pEE14 vector for CHO 
30 expression it has been possible to raise the expression level of these proteins. These 
proteins may be used to achieve protection against at least two paramyxoviridae 
viruses with a single immunogen. 
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Some of the chimeric molecules have been inserted into the shuttle vectors, 
pAcUWSl and pACGP67, which integrate in the genome of bacterial and 
lepidopteran cells. Surprisingly good expression of heterochimeric proteins was 
5 obtained from insect cells. 



Vector construction 
Preliminary Constructs 

10 

a) Plasmid pNTV2819 

Starting from plasmid pNIV2801, a cDNA clone encoding inter alia the F protein 
of RSV (type RSS-2; received from Dr Pringle, UK) we reconstructed a cDNA 
15 module coding for the F protein lacking the membrane anchor sequence. 

Plasmid pNIV2801 was digested with Pstl in order to recover a 1416 bp DNA pieci 
encoding amino acid residues 18 to 489 of the F protein. Synthetic 
oligonucleotides, specifying respectively the sequences for amino acids 1 to 17 and 

20 490 to 526, were used to produce the corresponding cDNA fragments by the 
polymerase chain reaction performed with pNIV2801 DNA as template. The 
primers were designed to generate also unique flanking restriction sites useful for 
subsequent cloning steps. The coding module was assembled, by ligation, from the 
three DNA pieces described above and introduced into the standard cloning vector 

15 pUC19, to create plasmid pNIV2819. This plasmid encodes the RSV F protein 
carrying its signal sequence but lacking its anchor sequence (figure 1). 

b) Plasmid pNIV 2820 

0 The cDNA module encoding the full length F protein of RSV was constructed as 
follows. Using two synthetic oligonucleotides, the polymerase chain reaction was 
performed with pNIV2801 DNA as template to generate a 273 bp DNA fragment 
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encompassing the sequence coding for aa 490 to aa 574 of the F protein, the stop 
codon and unique restriction sites useful for subsequent cloning steps. This 
fragment was digested with Nsil and EcoRl and substituted for the Nsil-EcoRl DNA 
piece present in the coding module of pNIV2819 (figure 2). The resulting plasmid, 
5 pNIV2820, thus encodes the RSV F protein carrying both signal and membrane 
anchor sequences. 



c) Plasmid pNTV2841 



10 In this construction, the DNA coding for aa 165 to 176 of the G protein of RSV is 
fused to the DNA encoding the RSV FsV protein. This part of the G protein is 
conserved among both subgroups of RSV. 

The starring material, pNIV2819, was digested by Ncol and Smal yielding a 1601 
15 bp fragment. This fragment was subcloned into the Ncol and Mscl sites of 

pNIV103 (a derivative of pULB1221, see European Patent Application No. 186643) 
leading to pNIV2844. This subcloning allowed to place the translation initiation site 
of the F protein in a more favourable context according to the model proposed by 
Kozak (Kozak M, Nature 308, 241-246, 1984). 

10 

A 1605 bp fragment was recovered from pNTV2844 by digestion with Kpnl and Sail 
and introduced by ligation into pUC19 digested with Kpnl and Sail, creating 
pNIV2840. 



Two complementary synthetic oligonucleotides specifying the sequence for amino 
acids 165 to 176 of the G protein followed by a stop codon and flanked by Mil, 
Bamtt, EcoRl and Hindlll sites were hybridized. The 55 bp resulting fragment was 
cloned into the pNIV2840 digested by Afal and Hindlll, thus replacing a 142 bp 
DNA sequence encoding amino acids 491 to 526 of the F protein. The resulting 
recombinant plasmid, pNIV2841, thus contains the sequence coding for amino acids 
1 to 490 of the F protein followed by amino acids 165 to 176 of the G protein 
(figure 3). 
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Vector Construction 

I) For transfer into the pSFVl vector 

5 a) The RSV fusion protein lacking the membrane anchor domain fused to the 
MuV hemagglutinin-neuraminidase lacking the signal-anchor domain, (1- 
526) HN MuV (60-582). 

Plasmid pNIV2875, a derivative of pNIV2820 which carries the DNA coding for 
0 the F protein of RSV in which the Spel restriction site has been eliminated by site- 
directed mutagenesis into the pUC19 vector, has been digested by Hindill and 
BspHh and a 1618 bp fragment has been isolated. Plasmid pNIV3229, a derivative 
of pNIV3215 whose construction has been already described in WO9425600 and 
which carries the DNA coding for the HN protein of MuV into the pUC19 vector, 
5 has been digested with Bbsl and BamHl; a 1580 bp fragment has been isolated. 
Both fragments were linked together by two complementary synthetic BspKI-Bbsl 
oligonucleotides (Fig 4A) restoring the coding sequence of the chimeric molecule 
and were inserted into the BamRl-Hindlll site of the pUC19 vector leading to 
pNIV4102. (Fig4B) After the sequencing of the junction regions, the chimeric 
3 cassette was retrieved from pNIV4102 by a BamHl digestion and was inserted into 
the BamHl site of the pSFVl vector (Liljestrom, P. and Garoff,H. (1991) 
Bio/Technology 9, 1356). The resulting plasmid, pNIV4104, contains into the 
pSFVl vector the sequence coding for amino acids 1 to 526 of the RSV F protein 
followed by amino acids 60 to 582 of the MuV HN protein. (Fig4C) 

5 

b) The RSV fusion protein lacking the membrane anchor domain fused to the 
PIV3 hemagglutinin-neuraminidase lacking the signal-anchor domain, (1- 
526) HN™ (70-572). 

) Plasmid pIBI-HN , a cDNA clone containing the complete coding sequence of 
protein HN of PIV3 as well as its 3* non coding sequence (received from Dr.K. 
Dimock, University of Ottawa, Canada), has been digested by Asel and BamHl and 
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a 1468 bp fragment has been isolated. Plasmid pNTV2875 (see supra), which carries 
the DNA encoding the F protein of RSV, in which the unique Spel site has been 
eliminated by site-directed mutagenesis, inserted into the pUC19 vector, has been 
digested by BamHl and BspHl, and a 1588 bp fragment has been isolated. Both 
5 fragments were linked together by two complementary synthetic BspHl-Asel 

oligonucleotides (Fig5A) and were inserted into the BamHl site of the pUC19 vector 
leading to pNIV4105 or to pNIV4109 (FigSB) depending of the orientation of the 
chimeric module in the vector. After the sequencing of the junction region, the 
chimeric cassette was retrieved by a BamHl digestion from pNIV4109 and inserted 
10 into the BamHl site of the pSFVl vector. The resulting plasmid, pNIV4110, 

contains, inserted into the pSFVl vector, the sequence coding for amino acids 1 to 
526 of the RSV F protein followed by amino acids 70 to 572 of the PIV3 HN 
protein. (Fig5C) 

15 c) The PIV3 fusion protein lacking the membrane anchor domain fused to the 
RSV attachment protein lacking the signal-anchor domain, F Pm (1-492) Grs V 
(69-298). 

Plasmid pNIV3310, described in WO9306218 which carries the DNA coding for 
20 amino acids 1 to 484 of the PIV3 F protein followed by amino acids 87 to 572 of 
the PIV3 HN protein into the pIBI vector, was digested by EcoBl and and a 
1435 bp fragment has been isolated. Plasmid pNIV2850, which carries the RSV G 
protein into the pUC19 vector, has been digested by Maelll and Hindlll, and a 694 
bp fragment has been isolated. Both fragments were then linked together by using 
25 two complementary Bglll-Maelll synthetic linkers (Fig6A) and were inserted into 
the EcoW-Hindlll sites of pUC19 vector leading to pNIV4103 (Fig6B). The 
chimeric module was then retrieved from the pUC19 vector by a BamHl-Hindlll 
digestion. After treating the protruding ends with the Klenow polymerase, the 
chimeric cassette has been inserted into the Smal site of pSFVl vector. The 
30 resulting plasmid pNIV4106, thus contains the sequence coding for amino acids 1 to 
492 of the F protein of PIV3 followed by amino acids 69 to 298 of the G protein of 
RSV inserted into the pSFVl vector (Fig6C). 
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d) The PIV3 fusion protein lacking the membrane anchor domain linked to the 
MuV hemagglutinm-neuraminidase lacking the signal-anchor domain, F rtV3 (1- 
493) HN MuV (60-582). 

5 

Plasmid pNIV3310 (see supra, FHN„ V3 in pIBI) was digested by EcoM and Bglll 
and a 1435 bp fragment was isolated. Plasmid pNIV3229 (see supra, HN MuV into 
pUC19) was digested by Bbsl and Hindlll, and a 1610 bp fragment was isolated. 
Both fragments were linked together by adding two synthetic complementary linkers 

10 specifying a Bglll and a Bbsl ends (Fig7A) into the pUC19 vector leading to 

pNIV4117 (Fig7B). After sequencing the junction region, the chimeric cassette was 
retrieved from the pUC19 vector by a BamUl digestion and was inserted into the 
BamHl site of the pSFVl vector. The resulting plasmid pNIV4118 encodes, cloned 
in the pSFVl vector, the DNA sequence specifying amino acids 1 to 493 of the 

15 PIV3 fusion protein linked to amino acids 60 to 582 of the MuV HN protein 
(Fig7C). 



e) The MuV fusion protein lacking its membrane anchor domain linked to the 
RSV attachment protein lacking its signal-anchor domain, F MllV (1-482) 
20 (69-298). 



Plasmid pNIV3221, described in WO9425600 which carries the sequence encoding 
amino acids 1 to 462 of the MuV fusion protein within the pUC19 vector, has been 
digested with £coRI and BsiFl, and a 771 bp fragment has been purified. Plasmid 
pNIV3221 has been also digested with flrrFI and Pstl, and a 628 bp fragment has 
been isolated. Plasmid pNIV2850 (see supra, G MV into the pUC19) has been 
digested with Mae/// and Hind/// and a 694 bp fragment has been isolated. The 
three fragments were linked together; the F mw /C KV junction was created by adding 
to the ligation reaction two synthetic complementary oligonucleotide specifying Pstl 
and Maelll sites (Fig8A), and were inserted into the EcdSl-Hindlll sites of the 
pBluescript vector leading to pNIV4H3(Fig8B). The chimeric cassette was 
recovered from pNIV4113 by a Asp718l digestion and, after treating the protruding 
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ends with the Klenow polymerase, was inserted into the Smal site of the pSFVl 
vector. The resulting plasmid, pNIV4114 contains into the pSFVl vector the 
sequence specifying amino acids 1 to 482 of the MuV F protein linked to amino 
acids 69 to 298 of the RSV G protein (Fig8C). 

5 

0 The MuV fusion protein lacking its membrane anchor domain linked to the 
PIV3 hemagglutinin-neuraminidase lacking its signal-anchor domain, F MuV (1- 
482) HN PrV3 (54-572). 

10 Plasmid pNIV4113 (see supra, F MuV x in pBluescript) was digested by Bsal and 
BamHl, a 1469 bp fragment was isolated. Plasmid pNIV3308, described in 
WO9306218 and which carries the DNA sequence specifying amino acids 1 to 31 
followed by amino acids 54 to 572 of the PIV3 HN protein into the pIBI vector, 
was digested by EcoRl and BamHl and a 1569 bp fragment was isolated. Both 

15 fragments were linked together by two synthetic complementary linkers specifying 
Bsal and EcoRl sites (Fig9A) into the BamHl site of pBluescript leading to 
pNIV4115 (Fig9B). The chimeric module was recovered from pNIV4115 by a 
BamHl digestion and was inserted into BamHl site of pSFVl vector. The resulting 
plasmid, pNIV4116, encodes, in the pSFVl vector, the sequence specifying amino 

20 acids 1-482 of the MuV F protein fused to amino acids 54 to 572 of the PIV3 HN 
protein (Fig9C). 

g) The RSV fusion protein lacking its membrane anchor domain linked to the 
RSV attachment protein lacking its signal-anchor domain, (1-526) Grsv(69- 
25 298). 

Plasmid pNIV2857 (Figl6A), a derivative of pNIV2841 and which contains the 
DNA sequence coding for amino acids 1 to 526 of the RSV fusion protein linked to 
amino acids 69 to 298 of the RSV attachment protein, has been digested by Asp718I 
30 and Hindlll and a 2180 bp fragment has been isolated. After treating the protruding 
extremities with Klenow's polymerase, this fragment has been inserted in the Smal 
site of the pSFVl vector. The resulting plasmid pNIV2870, contains in the pSFVl 
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vector, the DNA sequence coding for amino acids I to 526 of the RSV fusion 
protein linked to amino acids 69 to 298 of the RSV attachment protein (Figl6B). 

II) For transfection into CHO cells 

5 

a) The RSV fusion protein lacking the membrane anchor domain fused to the 
MuV hemagglutinin-neuraminidase lacking the signal-anchor domain, (1- 
526) HN MuV (60-582). 

10 Plasmid pNIV4102, (FiglOA, see supra, Frsv x HN MuV into the pUC19 vector) has 
been digested with Bamffl, and after treating the protruding ends with the Klenow 
polymerase, the chimeric module has been inserted into the Smal site of the 
glutamine synthetase (GS) vector, pEE14 (Coctcett et al y 1990, Bio/Technology 8, 
662-667). The resulting plasmid pEE14 FsV RSV x HN s'a' MuV contains 

15 sequences coding for amino acids 1 to 526 of the RSV F protein fused to amino 
acids 60 to 582 of the MuV HN protein under the control of the major immediate 
early promoter of the human cytomegalovirus (hCMV-MIE) (FiglOB). 

b) The RSV fusion protein lacking its membrane anchor domain linked to the 
20 PIV3 hemagglutinin-neuraminidase lacking its signal-anchor domain, F^ (1- 

526) HNpjv 3 (70-572). 

Plasmids pNIV4105 and pNIV4109 (FigllA and B, see supra, F^ x HN PJV3 into 
the pUC19 vector) were digested by EcoKL and Xhol and a 2032 bp as well as a 
25 1064 bp fragments were isolated. Both fragments were inserted together into the 
EcoKI site of pEE14. The resulting plasmid pEE14 Fs + a* RSV x HNs a* PIV3 
contains sequences coding for amino acids 1 to 526 of the RSV F protein fused to 
amino acids 70 to 572 of the PIV3 HN protein under the control of the hCMV 
promoter (FigllC). 

30 
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c) The PIV3 fusion protein lacking the membrane anchor region linked to the 
RSV attachment protein lacking the signal-anchor domain, F Pm (1-492) 
(69-298). 



5 Plasmid pNIV4103 (Figl2A, see supra, F PTO x Grsv into the pUC19 vector) was 
digested by Hindlll and a 2180 bp fragment was isolated. After treating the 
protruding extremities with the Klenow polymerase, the chimeric module was 
inserted into the Smal site of the pEE14 vector. The resulting plasmid, pEE14 Fs 
PIV3 x GsaRSV, contains, under the control of the hCMV promoter, the sequence 
10 encoding amino acids 1 to 492 of the PIV3 F protein followed by amino acids 69 to 
298 of the RSV G protein (Fig 12B). 

d) The PIV3 fusion protein lacking the membrane anchor domain fused to the 
MuV hemagglutinin-neuraminidase lacking the signal-anchor domain, F rm (1- 
15 493) HN MuV (60-582). 



20 



was 



Plasmid pNIV4117 (Figl3A, see supra, F P1V3 HN MuV into the pUC19 vector) 
digested with Hindm and a 3119 bp fragment was isolated and inserted into the 
Hindlll site of the pEE14 vector. The resulting plasmid, pEE14 FsV PIV3 x HNs' 
a MuV, contains under the control of the hCMV promoter a sequence encoding 
amino acids 1 to 493 of the PIV3 fusion protein fused to amino acids 60 to 582 of 
the MuV HN protein (Figl3B). 



e) The MuV fusion protein lacking its membrane anchor domain fused to the 
25 RSV attachment protein lacking its signal-anchor domain, F MuV (1-482) 
(69-298). 

Plasmid pNIV4113 (Figl4A, see supra, F MuV G m into the pBluescript vector) has 
been digested Asp718l, the protruding ends have been treated by the Klenow 
30 polymerase. A 2200 bp fragment has been isolated and inserted into the Smal site of 
pEE14. The resulting plasmid, pEE14 Fs'a' MuV x GsaRSV, has, under the 
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control of the hCMV promoter, the sequence encoding amino acids 1 to 482 of the 
MuV F protein followed by amino acids 69 to 298 of the RSV G protein (FigHB). 

f) The MuV fusion protein lacking its membrane anchor domain fused to the 
i PIV3 hemagglutmin-neuraminidase lacking its signal-anchor domain, F MuV (1- 
482) HN Pm (54-572). 

Plasmid pNIV4115 (Figl5A, see supra, F MuV x HN PIV3 into the pBluescript vector) 
has been digested with EcoRl and a 3040 bp fragment has been inserted into the 
EcoM site of the pEE14 vector. The resulting plasmid, pEE14 FsVMuV x HNs*a' 
PIV3, contains, downstream to the hCMV promoter region, a sequence coding for 
amino acids 1 to 482 of the MuV F protein followed by amino acids 54 to 572 of 
the PIV3 HN protein (Figl5B). 



15 g) The RSV fusion protein lacking its membrane anchor domain linked to the 
RSV attachment protein lacking its signal-anchor domain, (1-526) G^o^- 
298). 

Plasmid pNIV2857 (Figl7A), a derivative of pNIV2841 and which contains the 
20 DNA sequence coding for amino acids 1 to 526 of the RSV fusion protein linked to 
amino acids 69 to 298 of the RSV attachment protein, has been digested by Asp718I 
and Hindlll and a 2180 bp fragment has been isolated. After treating the protruding 
extremities with Klenow's polymerase, this fragment has been inserted the Smal site 
of the pEE14vector. The resulting plasmid, pEE14 Fs + a RSV x Gsa" RSV, contains 
under the control of the hCMV promoter the DNA sequence coding for amino acids 
1 to 526 of the RSV fusion protein linked to amino acids 69 to 298 of the RSV 
attachment protein (Figl7B). 

h) The original RSV fusion protein lacking the membrane anchor domain 
30 linked to the PIV3 hemagglutm-neuraininidase lacking the signal-anchor 
domain, F^y (1-526) HN P1VJ (70-572) bis. 



25 
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Plasmid pNIV2852, a derivative of pNIV2820 which carries the DNA encoding the 
RSV F protein where the translation initiation site is in a more favourable context 
according to the model proposed by Kozak (Kozak M, Nature 308, 241-246, 1984), 
has been digested BamHI and BspHI, and a 1588 bp fragment has been isolated. 

5 

Plasmid pIBI-HN, a cDNA clone containing the complete coding sequence of the 
HN protein of PIV3 (received from Dr. K. Dimock, University of Ottawa, Canada) 
has been digested by Asel and BamHI and a 1468 bp has been isolated. 

10 Both fragments were linked together by two complementary synthetic BspHI-Asel 
adaptators (Figl8A) and were inserted into the BamHI site of the pUC19 vector 
leading to pNIV4120 (Figl8B). 

After the sequencing of the junction region, the chimeric cassette was retrieved by a 
15 BamHI digestion from pNIV4120 and inserted into the BamHI compatible Bell site 
of the pEE14 vector. The resulting plasmid pEE14 FsVRSV x HNsa* PIV3 bis 
contains the sequences coding for amino acids 1 to 526 of the RSV F protein fused 
to amino acids 70 to 572 of the PIV3 HN protein under the control of the hCMV 
promoter (Figl8C). 

20 

This construct differs from the earlier pEE14 Fs + a'RSV x HNs a" PIV3 construct 
(II-a) in the F coding region. In F^Ny^ bis, the nucleic acid sequence found in 
F^yHNp,^, ATG GAT CTG (those codons are specifying aa Metl, Asp2 and Leu3) 
and ACC AGT (specifying aa Thr54 and Ser 55) is replaced by the original 
25 sequence of the RSV F protein that is ATG GAG TTG (specifiyng aa Metl, Glu2, 
Leu3) and ACT AGT (specifying Thr54 and Ser55). 

i) The original RSV fusion protein lacking the membrane anchor domain linked 
to the PIV3 hemagglutinin-neuraminidase lacking the signal-anchor domain 
30 with, at the C-terminal part, a polyhistidine tail preceded by the enterokinase 
cleavage site, (1-526) HNp^ (70-572) en his 
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Plasmid pIBI-HN, a cDNA containing the PIV3 HN protein coding sequence (see 
supra) has been digested by PstI and SphL A 4588 bp fragment has been isolated 
and linked to complementary synthetic Pstl-SphI adaptators (Figl9A). 

5 After the sequencing of the junctions as well as the synthetic linkers, the resulting 
plasmid pNIV3340 has been digested by Xhol and BamHI and a 1 121 bp fragment 
has been isolated (Figl9B). 

Plasmid pNIV4120 (see supra) has been digested by Xhol and BamHI and a 2017 bp 
10 fragment has been isolated (Figl9C). 

Both fragments were linked together and inserted into the BamHI compatible Bell 
site of the pEE14 vector. The resulting plasmid pEE14 FRSVsV x HNs'a" en his 
contains, under the control of the hCMV promoter, sequences coding for amino 
15 acids 1 to 526 of the RSV fusion protein fused to the amino acids 70-572 of the 
PIV3 HN protein fused to the enterokinase cleavage site,({Asp} x4 Lys) followed 
by a polyhistidine tail ({his}x6) and a stop codon (Figl9D). 

20 j) The signal domain of the tissue plasminogen activator fused to the yeast 
ubiquitin followed by the enterokinase cleavage recognition site and the 
original RSV fusion protein lacking its membrane signal and anchor domains 
linked to the PIV3 hemagglutin-neuraminidase lacking the signal-anchor 
domain, sTPA(l-21) UB(l-74) ent (24-526) HN^ (70-572)bis. 

25 

1) The signal domain of the tissue plasminogen activator fused to the yeast 
ubiquitin. 

A 208 bp fragment corresponding to amino acid 1 to 76 of the ubiquitin protein of 
30 Saccharomyces cerevisiae was isolated by a digestion of pNIV3475 ( a derivative of 
YEPUBSTUALL. a yeast 2 \i vector backbone carrying the yeast ubiquitin) with 
BamHI and Xbal (Fig 20 A). 



-29- 



WO 00/18929 



PCT/EP99/07004 



Plasmid JW4304 (received from J. Mullins, University of Washington, U.S. A) 
which encodes the signal domain of the tissue plasminogen activator (sTPA) was 
digested by Nhel and BamHl and a 51 15bp was isolated. Both fragments were 
5 linked together using two synthetic complementary Nhel-Xbal adaptators (Fig20B). 
The resulting plasmid pNIV4121 was digested by Hindffl and BamHI. A 330 bp 
fragment was isolated and inserted into the Hindffl and BamHI sites of the 
pBluescript vector. The resulting plasmid pNIV4122 contains the DNA sequence 
specifying the signal domain of the tissue plasminogen activator followed by an 
10 alanine and a serine residue (those two amino acids are known to produce a good 
leader cleavage) fused to the yeast ubiquitin (Fig 20C). 

2) The signal domain of the tissue plasminogen activator linked to the yeast 
ubiquitin followed by the enterokinase cleavage recognition site and amino acid 
15 24 to 55 of the original fusion protein of RSV. 

Plasmid pNIV4122 (Fig 21A, see supra) was digested by Aflll and Spel. A 3212 bp 
fragment was isolated and linked to synthetic complementary Aflll-Spel adaptators 
(Fig21B). The entire module was then sequenced. The resulting plasmid pNIV4123 
20 encodes the signal domain of the tissue plasminogen activator linked to the N- 

terminal 74 aa of the yeast ubiquitin followed by the recognition site of enterokinase 
{(Asp)4 Lys} and amino acid 24 to 55 of the original fusion protein of RSV 
(Fig21C). 



25 3) The signal domain of the tissue plasminogen activator linked to the yeast 
ubiquitin followed by the enterokinase cleavage recognition site and the RSV 
fusion protein linked to the PIV3 hemagglutin-neuraminidase lacking their 
membrane domains. 



30 Plasmid pNIV4123 (Fig 22A, see supra) was digested by Hindffl, treated by the 
Klenow polymerase and digested by SpeL A 408 bp fragment has been isolated. 
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Plasmid pNIV4120 (Fig 22B, see supra) has been digested by Xbal, treated by the 
Klenow polymerase, and digested by SpeL A 5620 bp fragment has been isolated. 

Both fragment have been linked together to generate pNIV4124 (Fig 22C). 

5 

The entire coding module was retrieved from pNTV4124 by a digestion with Xbal 
and EcoRI and was inserted into the Xbal and EcoRI sites of the pEE14 expression 
vector. The resulting plasmid pEE14 sTPA x UBI x EN x Fs aRSV x HNsa'PP/3, 
contains, under the control of the hCMV promoter, the sequence coding for aal-21 
10 of the tissue plasminogen activator followed by an alanine and a serine residue, by 
the 74 N-terminal amino acids of the yeast ubiquitin, by the recognition cleavage 
site of the enterokinase ({Asp}4 Lys), by aa 24-526 of the original RSV fusion 
protein and by aa 70-572 of the hemagglutin-neuraminidase of PIV3. 

15 HI) For transfection into Insect Cells 

a) The original RSV fusion protein lacking the membrane anchor domain 
linked to the PiV3 hemagglutin-neuraminidase lacking the signal-anchor 
domain, (1-526) HNpi V3 (70-572) bis. 

20 

Plasmid pNIV4120 (FIG 23A) was digested by BamHImd a 3114 bp fragment was 
isolated and inserted into the BamHI site of the baculovirus transfer vector, 
pAcUWSl (PharMingen). The resulting plasmid pNIV4132 (Fig 23B) contains, 
under the control of the polyhedrin promoter, the sequence coding for amino acids 
25 1-526 of the RSV F protein fused to amino acids 70-572 of the PiV3 HN protein. 

b) The baculovirus gp67 signal peptide fused to the original RSV fusion protein 
lacking both membrane signal and anchor domain linked to the PiV3 
hemagglutin-neuraminidase lacking the signal-anchor domain, SGP67FRSV (25- 

30 526) HN PiV3 (70-572) bis. 
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Plasmid pNIV4120 (FIG 24A, see supra) was digested by BamHl and Spel and a 
2939 bp fragment was isolated, linked to two complementary synthetic BamHI-Spel 
adaptators and inserted into the BamHI site of the baculovims transfer vector, 
pAcGP67A (PharMingen). The resulting plasmid pNIV4136 (Fig 24) contains, 
5 under the control of the polyhedrin promoter, the sequence coding for amino acids 
1-38 of the Baculovirus gp67 protein, followed by an Alanine and an Aspartate 
linked to amino acids 25-526 of the RSV F protein fused to amino acids 70-572 of 
the PiV3 HN protein. 

10 Expression in eukaryotic cells 

A) via the pSFVl vector 

The pSFVl vector is based on the Semliki Forest Virus (SFV) replicon. The DNA 
15 of interest is cloned into the pSFVl vector that serves as a template for in vitro 
synthesis of recombinant RNA. The RNA is transfected into mammalian cells such 
as BHK-21 cells. The recombinant RNA in the cells drives its own replication and 
capping resulting in production of heterologous protein. 

20 Plasmids pNIV2870 was digested with Pvul; pNIV4106, pNIV41 10, pNIV41 14, 
pNIV41 16 and pNIV41 18 were digested with Spel prior to RNA transcription. 
After a phenol extraction followed by an ethanol precipitation, 2 of linearized 
DNA was used as a template for RNA production. About 5 jig RNA was used to 
transfect, by electroporation, about 8 10 6 BHK-21 cells. All experimental 

25 procedures for RNA production and cell transfection are detailed in Liljestrom and 
Garoff (Bio/Technology, 1991, 9, 1356). 

After 24 h to 48 h post-electroporation, cells and spent culture medium have been 
collected for ELISA and radioimmunoprecipitation assays. 
30 a) pNIV4104, F w HN MuV 
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ELISA were done using mAb 2072 anti-HN MuV (Orvell, 1984, /. Immunology 
132, 2622-2629) or 20RG45, a goat anti-RSV serum (Fitzgerald, U.S.A.) to coat 
the microliter plates and a rabbit polyclonal anti-SBL-1 (MuV) serum or mAbl9 
anti-F RSV (G.Taylor, Inst, of Animal Health, Compton Lab., U.K.) as capture 
5 antibody. 

Radioimmunoprecipitation of the 35 S-methionine labelled product was done using 
mAb2072 (Orvell) and products were resolved onto 7.5% SDS-PAGE. 

10 b) pNIV4110, HNpro 

ELISA were done using anti-RSV goat serum 20RG45 or mAb anti-HN^ 4830 
(Rydbeck et aU J. Gen. Virol 67 y 1531-1542, 1986) to coat microliter plates and 
mAbl9 anti-F RSV (G.Taylor) or rabbit anti-PIV3 (E.Norrby, Stockholm) serum as 
15 a capture antibody. 

Radioimmunoprecipitation was done using anti-HN PIV3 mAb4830. 

c) pNIV4106, F PIV 3 

20 

ELISA were done using mAb anti-F PIV3 4549 (E.Norrby, Stockholm) or mAb ami 
Grsv 858-2 (Chemicon, U.S.A.) to coat microtiter plates and a rabbit anti-PIV3 
serum as a capture antibody. 

25 Radioimmunoprecipitation was done using mAb anti-F PIV3 3283 (Behringwerke). 

d) pNIV4118, F prv3 HN MuV 

ELISA plates were coated with anti-F PIV3 mAb 1031215 (Norrby) or with mAb 
30 2072 anti-HN MuV (Orvel) and rabbit anti-PIV3 sera or rabbit anti-MuV sera were 
used as capture antibody. 

Immunoprecipitation of labelled product was done using mAb 2072 anti-HN MuV. - 
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e) pNIV4114, F MuV x 

ELISA plates were coated with anti-F MuV monoclonal 5414 (Orvell) or anti 
5 mAb (Chemicon) and a rabbit anti-SBL-1 serum was used as a capture antibody. 

0 pNIV4116, F MuV x HN PIV 3 

ELISA plates were coated with anti-F MuV mAb 5414 (Orvell) or mAb anti-HN 
10 PIV3 4830 (Norrby) and rabbit anti-SBL-1 serum or a rabbit anti-PIV3 serum as a 
capture antibody. 

g) pNIV2870, F^x Grsv 

15 ELISA were done using 20RG45, a goat anti-RSV serum (Fitzgerald, U.S.A.) to 
coat the microtiter plates and mAbl9 anti-F RSV (G.Taylor, Inst, of Animal 
Health, Compton Lab., U.K.) as capture antibody. 



20 B) Expression in CHO cells (stable transformants) 

All recombinant plasmids were transfected by calcium phosphate coprecipitation 
into CHO-KI cells, using 20 jug DNA per 1.25 10 6 cells. The CHO-KI cells were 
grown in GMEM-S medium. The GS transfectants were selected by adding 25 

25 methionine sulfoximine to the culture medium two days after transfection. After ten 
to fourteen days, resistant colonies were picked and transferred into 96 wells plates. 
Each transformant was then transferred into 24 wells plates and subsequently to 80 
cm 2 flasks. The GS transformants were assayed for the recombinant products when 
cells reached about 80% confluency. The procedure follows the one described in 

30 Cockett et al (Bio/Technology, 1990, 8, 662-667). 
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EUSA and immunoprecipitation of radiolabelled products were done using the same 
procedures as the ones described above for the pSFVl system. 



Results 
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Expression in Insect cells 

10 a) Expression in lepidopteran cells. 

The vector pAcUWSl is a shuttle vector for bacteria and lepidopteran cells. A 
heterologous protein coding sequence can be inserted downstream the baculovirus 
plO promoter or either downstream the polyhedrin promoter. 

15 

The pAcGP67 vector is a shutde vector for bacteria and lepidopteran cells that 
contains the gp67 signal sequence upstream a multiple cloning site. A heterologous 
gene can be inserted in one of the cloning site and will be expressed as a gp67 
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signal peptide fusion protein under the control of the polyhedrin promoter. The 
gp67 signal peptide mediates the secretion of the recombinant protein. 

Either P AcUW51 or P AcGP67 recombinant plasmid can be transfected along with 
5 baculovinis linearised DNA into Sf9 cells (Baculogold DNA, PharMingen). This 
leads to the generation of a recombinant baculovinis stock. The expression of the 
recombinant heterologous protein is obtained by infecting insect cells with the 
recombinant baculovinis 

10 Plasmid pNIV4132 or plasmid pNIV4136 were transfected with baculovinis linearised 
DNA into Sf9 cells. Recombinant baculovinis 3546 (derived from cells transfected by 
PNIV4132) or 5V (derived from cells transfected by pNIV4136) were plaque purified 
and were used to infect Sf9 or High Five™ cells (Invitrogen). 24h to 72 h post- 
infection the cells and the spent culture medium have been collected for ELISA and 

15 Western blot analysis. 

ELISA were done using anti-RSV goat serum 20RG45 (Fizgerald) to coat microtiter 
plates and mAbl9 anti-F RSV (G.Taylor) as a capture antibody. 

20 Western blots were done using mAbl9 anti-F RSV (G.Taylor) or using anti-RSV 
goat polyclonal serum 20RG45 (Fizgerald). 

The spent medium from cells infected by either baculovinis 3546 or by 5V tested 
positive in ELISA. The level of expression, depending on the host cell line (SF9 or 
High Five), multiplicity of infection, medium (fetal calf serum supplemented or 
serum free synthetic medium) was at least ten times higher than the one obtained 
with a recombinant CHO-KI clone obtained by transfection with pEE14 F^ (1- 
526) HNp 1V 3 (70-572)bis . 

30 In addition, the spent medium of the baculovinis infected cells reacted positively in 
Western blot. A band in the vicinity of 1 lOkDa was present in the immunoblots. These 



25 
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results confirm the secretion of the chimeric F RSV -HNpiV3 into the medium of Sf9 and 
High Five ceils infected with the recombinant baculoviruses. 

b) Purification or the recombinant product 

5 

SF9 cells, adapted to serum free medium, were infected with the plaque purified 
recombinant baculovirus V5 or 3546. The cells were grown in suspension in 500ml 
Erlenmeyer flask in SF900II medium (Gibco BRL). The medium from virus infected 
cells were harvested two days post-infection. The soluble Frsv-HNrvs product was 

10 purified from the medium of infected cells by immunoaffinity chromatography using an 
anti-F RSV monoclonal antibody, mAbl9. The anti-F monoclonal antibody was 
coupled to Activated CH Sepharose 4B (Pharmacia) following the manufacturer 
instructions. The immunoaffinity gel was washed 3 times with 10 bed volumes of 
buffer A (20mM phosphate buffer pH 6.4, NaCl 150mM) prior to sample loading. 

15 After 16 hours at 4°C, the gel was washed with buffer A and the chimeric product was 
eluted with lOOmM phosphoric acid. Eluted protein was neutralized immediately with 
one tenth of volume of 1M phosphate buffer pH 7. 

SDS-PAGE of the immunoaffinity-purified F RSV -HNp iV3 revealed the presence of a 
20 major protein band of about 110 kDa. This protein was visualized by Coomassie 
blue staining of the gel and reacted with the monoclonal antibody anti-F^y (mAbl9) 
or with the polyclonal serum (20RG45) on immunoblots (Fig25). 

c) production of polyclonal antibodies 

25 

In order to obtain specific antibodies, the baculovirus derived FRsv-HN P iv3 protein, 
purified by immunoaffinity as described above, was used to immunise four BalbC mice 
and two New Zealand white rabbits. Three sub-cutaneous injections of 
20ng/ml/dose/rabbit or 6(ig/100^tl/dose/mouse were done at three weeks interval. The 
30 sera were collected 3 weeks after the second and the third injection and the antibody 
response was detected using ELISA and Western blots assays. 
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1) ELISA assays 

a) Mice response 

The antibody response was followed using a goat anti-RSV serum (20RG45, 
Fitzgerald, USA) to coat the microtiter plates and mouse anti-FHN sera as capture 
5 antibody. The antigens used were either the Faasv-Drosophila or CHO derived, the 
Frsv-HNp^ expressed in baculovirus and the medium of CHO cells transfected by 
the pEE14 was used as a negative control. 

3 our of 4 mice sera collected after the second injection showed some but low 
10 specific response. However, the mice sera collected after the third injection showed 
a high increase in level of specific antibodies. 

b) Rabbit response 

The antibody response was followed using either one of the following ELISA. The 
15 antigens were the same as the one used to detect the mice antibody response. 

Either a goat anti-RSV serum (20RG45, Fitzgerald, USA), either a monoclonal 
antibody directed against the RSV fusion protein (mAbl9, Compton Lab, UK) or a 
monoclonal antobody directed against the PiV3 hemagglutinin-neuraminidase 
(mAb3285, Behring) were used to coat the microtiter plate and the rabbit anti-HN sera 
20 was used as a capture antibody. The first and the second test bleeds generated high 
specific antibodies. 

2) Western blot assays 

Recombinant Fa-RSV CHO-KI ou Drosophila derived, Frsv-HNpivs baculovirus 
25 derived or the CHO-pEE14 spent medium culture were electrophoresed onto a 15% 
SDS-PAGE and transferred onto a nitrocellulose membrane (Amersham). The rabbit 
anti-HN sera as well as the mouse anti-HN sera detected specifically either the F 
protein or the F RS v-HNpjV3 chimera. 
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Example 2 

i) Optimization of the codon usage of the nucleic acids sequence coding for the 
RSV fusion protein lacking the membrane anchor domain linked to the PiV3 
hemagglutin-neuraminidase lacking the signal-anchor domain, (1-526) 
5 HN Pi v3 (70-572) for the expression in mammalian cells. 

A table showing the comparison of the codon usage found in the FrsvHNpsvj module 
with the one found in highly expressed human gene can be found in Fig.26. As 
noted, the most prevalent codons found in the Fr^N^ module have an A or a T 
10 at their third degenerative position, whereas the human prevalent codons have a C 
or a G. For the improvement of the FRsyHNp^ protein expression, the entire coding 
sequence has been re-engineered to fit at best the human codon usage. The re- 
engineered sequence was obtained using synthetic long oligonucleotides, polymerase 
chain reaction (PCR) and conventional cloning procedures. 

15 

Re-engineering of the coding sequence of the Fr^EN^ module 
The entire synthetic sequence was recovered by joining three PCR fragments (A, B 
and C). The general strategy to obtain each PCR fragment is schematically 
represented in Fig 27. It consists of assembling overlapping long oligonucleotides in 
20 a first round amplification. The resulting full size fragment is further amplified 
using two short primers located on each of its extremities. 



Construction of fragment A 

The first PCR fragment, corresponding to 18 bases encoding restriction sites 
25 followed by bases 1 to 1269 of the FrsvHNp,^ followed by 8 bases encoding 
restriction sites, was obtained by PCR assembly of 18 overlapping oligonucleotides 
(Fig 28). This fragment has been inserted in the pCRIITOPO cloning vector 
(Invitrogen). After sequencing the fragment, it was retrieved from the pCRIITOPO 
vector by a Xbal and BsrGI digestion and inserted into the corresponding sites of 
30 pNIV4120. The module corresponding to Fr^N^ with bases 1 to 1264 
humanized was then retrieved by an Xbal and EcoRJ digestion and inserted into the 
corresponding sites of pEE14 (Fig.29) generating pEE 1 4XFR5 v humHN PjV3 . 
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Construction of fragment B 

The second PCR fragment B corresponding to 13 bases encoding unique restriction 
sites followed by bases 1264 to 2136 of F w HN„v3 was obtained by assembling 10 
oligonucleotides whose sequences can be found in Fig.30. This fragment has been 
5 inserted in the pCRIITOPO vector and sequenced. This fragment has been 
recovered by a BsrGI and Kpnl digestion. 

Construction of fragment C 

The third PCR fragment corresponding to bases 2023 to 3090 followed by 6 extra 
10 bases encoding an EcoRI site has been assembled starting from the 15 
oligonucleotides shown in Fig 31. This fragment has been inserted in the 
pCRIITOPO cloning vector and sequenced. This fragment has been retrieved by a 
Kpnl and EcoRI digestion (Fig 31). 

15 Construction of the entire coding sequence 

The entire F RSV HN PiV3 codon optimized coding sequence has been obtained by 
assembling fragment A, B, C as shown in Fig.32. pNIV4120 in which the PCR 
fragment A has replaced the original sequence (see Fig.29) was digested by BsrGI 
and EcoRI. The original sequence was eliminated and replaced by the BsrGI- Kpnl 

20 fragment B and the Kpnl-EcoRI fragment C. The codon optimized module was 
retrieved from the PCRIITOPO vector by a Xbal and an EcoRI and inserted in the 
corresponding sites of the pEE14 vector. The resulting plasmid, pEERF,^ 
humHNp iV3 hum, encodes for the entire humanized coding sequence. The humanized 
pRsvHNpivs nucleic acids sequence is shown in Fig. 33. 

25 

Expression in CHO-KI cells 

The recombinant pEE14 F^ humHN PjV 3 (see construction of fragment A, above, or 

recombinant pEEMF^v humHN PiV3 hum see construction of the entire coding 

30 sequence, above) was transfected using the FuGene reagent (Boeringer Mannheim), 

using 5 M g DNA per 1.25 10 6 cells. The CHO-KI cells were grown in GMEM-S 

medium. The GS transfectants were selected by adding 25 /xM methionine 

sulfoximine to the culture medium two days after transfection. After ten to fourteen 
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days, resistant colonies were picked and transferred into 96 wells plates. Each 
transformant was then transferred into 24 wells plates and subsequently to 80 cm 2 
flasks. The GS transformants were assayed for the recombinant product when cells 
reached about 80% confluency. The procedure follows the one described in Cockett 
5 et al (Bio/Technoiogy, 1990, S, 662-667). Alternatively, the expression was 

evaluated three to five days after the addition of sodium butyrate (2mM) in the cell 
culture. 

To compare the expression level to that of the non humanized F^yHNpiva, ELISA 
10 assays were done, using 20RG45, a goat anti-RSV serum (Fizgerald, U.S.A.) to 
coat the microliter plates and mAbl9 anti-F RSV (G. Taylor, Inst, of Animal 
Health, Compton Lab, U.K.) as capture antibody. The expression level was 
estimated using a purified Fa-^ expressed in the Drosophila system. 

15 The level of expression of the non-humanized expressed product by 

pEE14F RSV HN PiV3 didn't exceed 0.03 mg/L and 0.1 mg/L when sodium butyrate 
was added to the culture medium. The level of expression of the partially 
humanized product expressed by pEEHF^vhurnHNp^, reached 1 mg/L and up to 
3 mg/L when sodium butyrate was added in the culture medium. The humanization 

20 of the sequence coding for amino acids 1-423 of the 1029 amino acids thus 
enhanced the level of expression up to 30 fold (see Figure 34a). 

The level of expression of the entirely humanized product expressed by pEE14F MV 
humHN PiV3 hum was at least of 2 mg/L and reached up to 50 mg/L when sodium 
25 butyrate was added in the culture medium. The humanization of the entire coding 
region of F RSV HNp j v 3 thus enhanced the level of expression of at least 200 to 500 
fold (see Figure 34b). 

ii) Optimization of the codon usage of the nucleic acids sequence coding for the 
30 mumps virus (MuV) fusion protein lacking the membrane anchor domain 
linked to the measles virus (MV) lacking the signal-anchor domain, F Muv (1-482) 
H Mv (59-617) for the expression in mammalian cells. 
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A table showing the comparison of the codon usage found in the FmuvHmv module with 
the one found in highly expressed human gene can be found in Fig.35. As it can be 
seen, the codon usage frequencies of this chimerical gene is quite different from those 
5 prevalent in the human genome. For the improvement of the FmuvHmv protein 
expression, the entire coding sequence has been re-engineered to fit at best the human 
codon usage. The re-engineered sequence was obtained using synthetic long 
oligonucleotides, polymerase chain reaction (PCR) and conventional cloning 
procedures. 

10 

Re-engineering of the coding sequence of the FmuvHmv module 

The entire synthetic sequence was recovered by joining four PCR fragments 
(A, B, C and D). The general strategy to obtain each PCR fragment is schematically 
represented in Fig 36. It consists of assembling overlapping long oligonucleotides in a 
15 first round amplification. The resulting full size fragment is further amplified using two 
short primers located on each of its extremities. 

Construction of fragment A 

The first PCR fragment, corresponding to 13 bases specifying restriction sites and a 
20 Kozak consensus motif followed by bases 1 to 1026 of the FmuvHmv was obtained by 
PCR assembly of 12 overlapping oligonucleotides (Fig 37). This fragment has been 
inserted in the pCRIITOPO cloning vector (Invitrogen). After sequencing the 
fragment, it was retrieved from the pCRIITOPO vector by a Xbal and TspRI 
digestion and a 963 bp fragment was further purified, leading to fragment A. 

25 

Construction of fragment B 

The second PCR fragment B corresponding to bases 965 to 1712 of FmuvHmv was 
obtained by assembling 9 oligonucleotides whose sequences can be found in Fig. 38. 
After its insertion into the pCRIITOPO vector and its sequencing, this 785 bp 
30 fragment has been recovered by a TspRI and Aval digestion. 
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Construction of fragment C 

The third PCR fragment C corresponding to bases 1712 to 2485 has been assembled 
starting from the 1 1 oligonucleotides shown in Fig 39. It has been inserted in the 
pCRIITOPO cloning vector and sequenced. This 774 bp fragment has been retrieved 
5 by an Aval and Apal digestion. 

Construction of fragment D 

The fourth PCR fragment D corresponding to bases 2485 to 3139 followed by 8 bp 
specifying a unique restriction site has been assembled starting from the 8 
10 oligonucleotides shown in Fig 40. This fragment has been inserted in the 
pCRIITOPO vector and sequenced. A 657 bp fragment has been recovered after an 
Apal and EcoRI digestion. 

Construction of the entire coding sequence 

15 The entire F MuV H Mv codon optimised coding sequence has been obtained by 
assembling fragment A, B, C, D and inserting the module digested by Xbal and 
EcoRI into the corresponding sites of the pEE14 vector (Fig. 41). The resulting 
plasmid, pEE 1 4F MuV humH Mv hum , encodes for a humanised sequence coding for aa 
1-482 of the mumps virus fusion protein followed by aa 59-617 of the measles 

20 virus. The humanised and original F MuV Hmv nucleic and amino acids sequences are 
shown in Fig. 42. 

iii) Purification and analysis of FHN expressed in CHO-KI 

25 a) Purification 

CHO cell line expressing secreted recombinant FHN was cultivated in cell factories in 
G-MEM medium supplemented with 2% FCS, in presence or absence of 1% Butyrate 
Na. FHN was purified by immunoaffinity chromatography by loading spent culture 
medium onto a Mabl9-sepharose column as described using the same experimental 
30 conditions. 
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When expressed in absence of Butyrate Na, purified FHN migrated on SDS-PAGE, in 
heating and reducing conditions, mainly as a band of 1 10 kDa. In contrast, FHN is 
visualized as a triplet of 1 10, 120 and 130 kDa when CHO cells are cultivated with 
butyrate. Heating has a more drastic effect than reducer on the FHN electrophoretic 
5 migration. Indeed, high molecular weight species are clearly detected in the 

preparation when electrophoresis proceeded without heating suggesting the presence 
of FHN aggregates or oligomers. These aggregates did not seem to be contaminated 
by CHO proteins. Antibodies directed to CHO proteins did not specifically recognize 
on Western blot any bands. Glycan analysis was performed using several lectins 
10 specific for different carbohydrate moieties. Surprisingly, FHN did not cany sialic 
acids or high-mannose structures but carbohydrates of galactose-acetyl-galactosamine 
type characteristic of hybrid N- and/or O-giycosylations. 

N-terminal microsequence analysis showed mainly the presence of Fl subunit in bands 
15 of 1 10-130kDa. The F2 N-terminal amino acid sequence detected in bands of lower 
and higher molecular weight indicated that some purified FHN molecules are present 
under a F0 form (non mature F). 

The presence of aggregates or oligomers in the FHN preparations was confirmed by 
20 gel filtration analysis and proteins were detected by laser-light scattering. Whatever the 
culture conditions (butyrate or not), between 50 and 65% of FHN populations 
displayed a molecular weight higher than 10 6 Da demonstrating that FHN is 
aggregated. 5 to 15% has a molecular weight ranging from 400 to 900 kDa whereas 
30 to 35% is monomelic FHN. 

25 

b) Serum immunoglobin analysis. 
Immunisation protocol 

The F RS vHNpiV3 protein was purified from the spent medium culture of the CHO-KI 
30 cells transfected by the recombinant pEEl 4 FRsvhumHNpjvshum by immunoaffinity 
chromatography as described (Purification of the recombinant product expressed in 
baculovirus recombinant infected SF9 cells). The product was injected in 7 groups of 
Balb CI mice as descibed in the following table 1. 
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Humoral response directed against the FHN protein 

The humoral response directed against the FHN protein was determined. To this end, 
ELISA plates were coated with immunoaffinity purified FHN protein. 
5 

Total IgG (Fig 43) 

To detect specific anti-FHN total IgG, ELISA plates were coated with 200ng of 
immunoaffinity purified FHN protein, plates were then saturated and dilutionsof the 
mice second bleed sera were then applied. Total IgG were detected using a biotinylated 
10 serum directed against mouse IgG. 

IgGl (Fig 44) 

To detect specific anti-FHN IgGl, ELISA plates were coated with lOOng of 
immunoaffinity purified FHN protein, plates were then saturated and dilutionsof the 
15 mice second bleed sera were then applied. IgGl were detected using a biotinylated 
serum directed against mouse IgGl . 

IgG2a (Fig 45) 

To detect specific anti-FHN IgG2a, ELISA plates were coated with lOOng of 
20 immunoaffinity purified FHN protein, plates were then saturated and dilutionsof the 
mice second bleed sera were then applied. IgG2a were detected using a biotinylated 
serum directed against mouse IgG2a. 

The titer of each sera was determined and a mean titer for each group was calculated 
25 and is reported in table 2. These experiments show that the FHN antigen by itself or 
formulated with adjuvant (group 1 to 3), stimulates a specific humoral response. 
Indeed, no anti-FHN antibodies are generated in the untreated mice group (group 5) or 
in the group immunised solely with the adjuvant (group 4). The group 1 (and group 4) 
adjuvant was 3D-MPL and QS21 formulated with cholesterol containing liposomes as 
30 described in WO 96/33739; the group 2 adjuvant was alum. 

The IgGl/IgG2a ratio indicates the Thl or Th2 orientation of the immune response; 

(Table2), a protective response against both the RS V or the PiV3 should tend toward 
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the Thl type, that is a low IgGl/IgG2a ratio. In this regard, the responses generated 
with the FHN formulated in the presence of the 3D-MPL + QS2 1 adjuvant appears to 
be the more promising one. 

Table 1: Experimental procedures 

Immunogenicity FHN in 

mice 



Group 


n 


Vol 
(Ml) 


route 


Antigen 


Immuno- 


buffer 


preservative 










nature 


dose 
(Mg) 


stimulants 






1 


12 


2x50 


IM 


FHN 


2 


3D-MPL/ 
QS21 


PBS mod 
pH7.4 


thiomersal low 
(lug/ml) 


2 


12 


2x50 


IM 


FHN 


2 


Al(OH)3 


PBS mod 

pH 7.4 


thiomersal low 
(lug/ml) 


3 


12 


2x50 


IM 


FHN 


2 


/ 


PBS mod 
pH 7.4 


thiomersal low 
(lUg/ml) 


4 


12 


2x50 


IM 


/ 


/ 


3D-MPL/ 
QS21 


PBS mod 
pH 7.4 


thiomersal low 
(lfig/ml) 


5 


12 


/ 


/ 


untreated 


/ 


/ 


/ 


/ 


6 


12 


2x30 


INA 


RSV live 




/ 


/ 


/ 


7 


12 


2x30 


INA 


PIV-3 live 




I 


/ 


/ 



IM=intra-muscuiar 
INA=intra-nasal 
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Antigen 


cc. ng/ml 


Buffer 


RSV live 


6.2 

logPFU/ml 




PIV-3 
live 


6.7 

logPFU/ml 




FHN 


120 (2.5ml) 


PBS 

pH7.3 



Time schedule: 
5 Injection 1 = Day 0 
Injection 2 = Day 28 
First Bleed = Day 28 
Second bleed = Day 42 
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Table 2: Serum antibody response against FHN. 

The total IgG, IgGl and IgG2a was determined for each mouse sera. A mean titer for 
each group was then calculated and is reported in the table. 



group 
n° 


Immunogen 


Total IgG 


IgGl 


IgG2a 


IgGl/IgG2a 


1 


FHN + 3D- 
MPL/QS21 


1182000 


109800 


305500 


0.36 


2 


FHN + Alum 


182200 


127100 


4429 


28.7 


3 


FHN 


44990 


22760 


1941 


11.73 


4 


adjuvant=from 
group 1 


49 


32 


ND 


ND 


5 


untreated 


52 


ND 


ND 


ND 


6 


Live RS V 


12840 


748 


2718 


0.27 


7 


Live PiV3 


10860 


2758 


2320 


1.19 



ND=undetermined, the titer being to low 
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Claims 

1 . A process for preparing a heterochimeric protein or an immunogenic 
derivative thereof comprising an immunogenic fragment of the fusion (F) protein of 
RSV, PIV1, PIV2, PIV3, MV or MuV and an immunogenic fragment of the 

5 attachment (G, HN or H) protein of RSV, PIV1, PIV2, PIV3, MV or MuV which 
process comprises expressing recombinant DNA encoding the heterochimeric 
protein or immunogenic derivative thereof in CHO cells and recovering the protein. 

2. A process according to claim 1 wherein at least one non-preferred or less 

10 preferred codon in a natural gene or DNA encoding the said heterochimeric protein 
or immunogenic fragment thereof has been replaced by a preferred codon encoding 
the same amino acid. 

3. A heterochimeric protein or an immunogenic derivative thereof comprising an 
15 immunogenic fragment of the fusion (F) protein of RSV, PIV1, PIV2, PIV3, MV 

or MuV and an immunogenic fragment of the attachment (G, HN or H) protein of 
RSV, PIV1, PIV2, PIV3, MV or MuV, with the proviso that where one of the 
immunogenic fragments is derived from RSV F, RSV G or PIV3 F, PIV3 HN, the 
other of the immunogenic fragments is derived from MuV F, MuV HN, MV F, 
20 MV H, PIV1 F,PIV1 HN, PIV2 F or PIV2 HN. 

4. A process for preparing a heterochimeric protein or immunogenic derivative 
thereof as claimed in claim 3 which process comprises expressing recombinant 
DNA encoding the heterochimeric protein or immunogenic derivative thereof in 

25 either one of; CHO cells or insect cells and recovering the protein. 

5. A protein according to claim 3 wherein the immunogenic fragment of the F 
protein is lacking the membrane anchor domain at its C-terminal end. 

30 6. A protein according to claims 3 or 5 wherein the immunogenic fragment of the 
G, HN or H protein is lacking the signal/anchor domain at its N-terminal end. 
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7. A protein according to any one of claims 3, 5 or 6 which is linked via an amino 
acid in the C- terminal part of the immunogenic fragment of the F protein of RSV, 
PIV1, PIV2, PIV3, MV or MuV to an amino acid in the N-terminal part of the 
immunogenic fragment of the G protein of RSV or the HN protein of PIV1, PIV2, 

5 PIV3, MuV or the H protein of MV. 

8. A protein according to any one of claims 3, 5, 6 or 7 which commences at its N- 
terminal end with a signal sequence from the F protein of RSV, PIV1, PIV2, PIV3, 
MV or MuV. 

10 

9. A protein according to any one of claims 3,5,6 or 7 which commences at its N- 
terminal end with a signal sequence from TP A. 

10. A protein according to any one of claims 3 or 5 to 8 which comprises a 
15 ubiquitin leader sequence. 

11. A protein according to any one of claims 3 or 5 to 9 which comprises a 
polyhistidine tail. 

20 12. A protein according to claim 10 or 1 1 which comprises a cleavage site for 
cleaving off the ubiquitin leader sequence and/or the polyhistidine tail. 

13. A heterochimeric protein according to any one of claims 3 or 5 to 1 1 which is 
selected from the group consisting of: 

25 Fs + a-RSVxHNsaMuV; 

Fs + a' PIV3 x HNs'a' MuV; 
Fs + a'MuV xGsaRSV; or 
Fs + aMuVxHNsaPIV3,or 
an immunogenic derivative thereof. 

30 

14. A heterochimeric protein according to any one of claims 3 or 5 to 11 which is 
selected from the group consisting of: 
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Fs + a*MuVxHsa*MV;or 
FsVRSVx HNsaPIVl, or 
Fs + a RSVx HNs a PIV2, or 
an immunogenic derivative thereof. 

5 

15. A heterochimeric protein which is: 

Fs + a (1-526) RSV x HNs'a* (70-572) PIV3, 
Fs + a* (1-492) PIV3 x Gs a* (69-298) RSV, 
Fs + a (1-526) RSV x HNsa* (70-572) PIV3 bis, 
10 Fs + a (1-526) RSV x HNs a' (70-572) PIV3 ent his, or 

sTPA (1-21) UB (1-74) ent Fs a' (24-526) x HN sa(70-572) PIV3, or 
an immunogenic derivative thereof. 

16. Recombinant DNA encoding a heterochimeric protein or an immunogenic 
15 derivative thereof according to any one of claims 3 or 5 to 15. 

17. Recombinant DNA according to claim 16 in which at least one non-preferred 
or less preferred codon in the DNA has been replaced by a preferred codon 
encoding the same amino acid. 

18. DNA which hybridises under conditions of high stringency with the DNA of 
20 claim 16 or 17. 

19. An expression vector comprising recombinant DNA according to claims 16 to 
18. 

20. A host transformed with DNA according to any one of claims 16 to 18 or with 
a vector according to claim 19. 

25 21. A host according to claim 20 which is a CHO cell. 

22. A host according to claim 21 which is an insect cell. 
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23. A vaccine composition comprising a protein according to any one of claims 3 
or 5 to 13 or an immunogenic derivative thereof in admixture with a 
phannaceutically acceptable carrier. 

24. A vaccine composition according to claim 23 further comprising 3D 
5 Monophosphoryl lipid A and/or QS-21 . 

25. A vaccine composition according to claims 23 or 24 wherein the carrier is an 
oil-in-water emulsion. 

26. A heterochimeric protein or an immunogenic derivative thereof according to 
any one of claims 3 or 5 to 15 for use in medicine. 

10 27. A process for the production of a heterochimeric protein according to any one 
of claims 3 or 5 to 15 which process comprises expressing recombinant DNA 
encoding said protein or immunogenic fragment thereof in a host cell and 
recovering the protein. 

28. A method of treating a human or animal susceptible to paramyxoviridae viral 
15 infections comprising administering an effective amount of a vaccine according to 

any one of claims 23 to 25. 

29. Use of a protein or an immunogenic derivative thereof according to any one of 
claims 3 or 5 to 15 in the manufacture of a medicament for use in the treatment of 
respiratory disorders. 
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A) Synthetic adaptators 

5' C ATG AAT GAT CAA GGC TTG AGC AA 3' 

TTA CTA GTT CCG AAC TCG TTA GTC 
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A) Synthetic adaptators 
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A) Synthetic adaptators 
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A) Synthetic adaptators 
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CAA ATG ATC AAG GCT TGA GCA A 3' 
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A) Synthetic adaptators 

5' C ATG AAC AAT GAG TTT ATG GAA GTT ACA GAA AAG ATC CAA 
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C) pEE14 F s + a' RSV xHN s'a* PiV3bis 



BclI/BamHI 



Frsv 1-526 



ATG 



BamHI /Bell 




HN PIV3 70-572bis 



^— pEE14 



STOP 
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Fig. 19 

A) Synthetic adaptators 

PstI 5'GT GAC GAT GAC GAT AAG CAT CAT CAT CAT CAT CAT TAG 
ACGTC ACA CTG CTA CTG CTA TTC GTA GTA GTA GTA GTA GTA ATC 

GGATCCGCATG 3' 

CCTAGGC SphI [SEQ ID NO: 8] 



B) pNIV3340 



Xhol PstI SphI BamHI 





HN P]V3 1-572 


Aspx4Lys Hisx6 


V 






t 



ATG 

C)pNTV4l20 

BamHI 



STOP 
Xhol BamHI 



1 



Frsv 1-526 



HN PIV3 70-572 



ATG 

C) pEE14 F s + a RSV xHN s'a PiV3 enthis 

BamHI Xhol 



STOP 



pIBI 



pUC19 



1 



1 



BamHI 



1-526 



HN PIV3 70>§^ 



Aspx4Lys isx6 



J pEE14 



t 

ATG 



STOP 
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Fig. 20 



A) pNIV3475 

EcoRI Xbal 



Aflll BamHI 



Saccharomyces cerevisiae Ubiquitin 



:?L-pUC19 



ATG 



Gly 76 



B) Synthetic adaptators 



5' CT AGC ATG CAG ATC TTC GTC AAG ACG TTA ACC GGT AAA ACC 
Nhel G TAC GTC TAG AAG CAG TTC TGC AAT TGG CCA TTT TGG 



ATA ACC y Xbal 
TAT TGG ATCT 



[SEQ ID NO: 9] 



C) pNIV4122 



HindHI Nhel Xbal 



Aflll BamHI 



aa 1-21 sTPA a-S 



aa 1-76 Ubiquitin 



-pBluescript 



ATG 



ATG 



r 

Gly76 
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Fig. 21 
A) pNIV4122 



HindHI 



Aflll BamHI Spel 



aa 1-21 sTPA 



Ala-Ser 



aa 1-76 Ubiquitin 



IpBluescript 



ATG 



Gly76 



B) Synthetic adaptators 

5 ! TTA AGA CTA AGA GAC GAT GAC GAT AAG TCC AGT CAA AAC 
Aflll CT GAT TCT CTG CTA CTG CTA TTC AGG TCA GTT TTG 

ATC ACT GAA GAA TTT TAT CAA TCA ACA TGC AGT GCA GTC AGC 
TAG TGA CTT CTT AAA ATA GTT AGT TGT ACG TCA CGT CAG TCG 

AAA GGC TAT CTT AGT GCT CTA AGA ACT GGT TGG TAT A3* Spel 
TTT CCG ATA GTT TCT CGA GAT TCT TGA CCA ACC ATA TGA TC 



[SEQ ID NO: 10] 



C) pNIV4123 



Hindlll Aflll Spel 



sTPA 


a-s 


aa 1-74 


enteroK 


aa 24-55 F^y 






Ubiquitin 





pBluescript 



ATG 
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Fig. 22 

A)pNIV4123 
Hindlll 



Spel 



1 








r 


sTPA 


a i aa 1 -74 
s jUbiquitin 


entero 
K 


aa 24-55 
Frsy 







r 



ATG ATG 



r 

ATG 



B)pNIV4120 

Xbal Spel 



Frsv 1-526 



HN PIV3 70-572 



ATG 



STOP 



C) pNIV4124 
Xbal 



Spel 

I 



sTPA 



aa 1-74 
Ubiquitin 



entero jaa 24-55 

K i F 

I r RSV 



Frsv 56-526 



.pBluescript 



EcoRI 



-pUC19 



EcoRI 



HN P1V3 70-572 



ATG 



-pUC19 



Stop 



D) pEE14 sTPA UBI EN Fsa" RSV x HN sa PiV3 



Xbal 



Spel 

I 



EcoRI 



STPA 


a 


aa 1-74 


entero 


aa 24-55 




s 


Ubiquitin 


K 


Frsv 











Frsv 56-526 



HNp IV3 70-572 



±_pEE14 



T 

ATG 



I 

Stop 
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A)pNIV4120 



BamHI 



BamHI 



Frsv 1-526 




HN PIV3 70-572 



.pUC19 



ATG 



STOP 



B)pNIV4132 



BamHI 



BamHI 



Polyhedrin 
promoter 



Frsv l" 526 




HN PIV3 70-572 



P AcUW5 1 
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Fig. 24 



A)pNIV4120 



24/73 



Spel 



RSV 



1-526 



ATG 



BamHI 




HN PIV3 70-572 



-pUC19 



STOP 



B) Synthetic adaptators 

5 'GAT CAA AAC ATC ACT GAA GAA TTT TAT CAA TCA ACA TGC 
BamHI TT TTG TAG TGA CTT CTT AAA ATA GTT AGT TGT ACG 

AGT GCA GTC AGC AAA GGC TAT CTT AGT GCT CTA AGA ACT 
TCA CGT CAG TCG TTT CCG ATA GAA TCA CGA GAT TCT TGA 

GGT TGG TAT A 3 'Spel 

CCA ACC ATA TGA TC [SEQ ID NO: 11] 



C) pNTV4136 



BamHI Spel BamHI 









pAcUWSl 


Polyhedrin 
promoter 




gp67 
signal 


Frsv 25-526 m HN PIV3 70-572 
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Fig 25: SDS-PAGE (reduced conditions) of the F RSV HN PiV3 protein purified by 
immunoaffinity from the spent culture medium of the recombinant baculovims 3546. 

kDa: molecular weight marker 
A: Coomassie blue staining 

B: Western blot revealed by a goat polyclonal anti-RSV serum 20 RG45 




SUBSTITUTE SHEET (RULE 26) 
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Fig . 26: Codon usage of F RSV HNp iV 3 and highly expressed human genes (hum high exp) 
showing frequencies (xlOO) of the individual codons for each of the degenerately encoded 
amino acids, and the most prevalent codon in bold. 
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GAT 
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Glu 



GAG 


75 


22 


A 


25 


77 



Gly 



CjGG 24 22 

A 14 37 

T 12 22 

C 50 19 



His 



CAT 


21 


86 
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79 
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lie 



ATA 
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Lys 



AAG 
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Pro 
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> 




im high e; 


> 

z 

ac 
> 




-i 
j= 








Leu 
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Fig. 27: Schematic diagram of the PCR synthesis of each fragment showing unique 
restriction sites along the sequence (black dots) and restriction sites (A and B) that allow 
retrieval of the full size fragment from the cloning vector. 




PCR 



ligation and insertion into the pcrllTOPO 
cloning vector. 



Sequencing 



Retrieval from the pcrllTOPO vector by restriction with 
enzyme A and B. 
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Fig . 28: Sequence of the 18 oligonucleotides from which PCR fragment A was generated. 

1) olf huml . seq, bases 1-90 of F RS vHN P iv3, homologous to mRNA 

5' cccTCTAGAGGATCCACCATGGAGCTGCTGATtttaAAGACCAACGCCATCACCGCCATCCTG 
GCCGCGGTGACCCTCTGCTTCGCGTCC 

2) olf hum2 . seq, bases 75-165 of F RSV HNpiv3, inverse complementary to 
mARN 

5 ' CCTCAGCGCGCTCAGGTAGCCCTTGCTGACAGCagaGCAGGTGGACTGGTAGAACTCCTCGGTG 
ATGTTCTGGCTGGACGCGAAGCAGAGG 

3) olfhum3.seq, bases 150-240 of F RS vHNp iV 3r homologous to mRNA 
5'CCTGAGCGCGCTGAGGACGGGGTGGTACACtAGtGTGATCACCATCGAGCTGAGCAACATCAAG 
GAGAACAAGTGCAACGGCACCGACGCC 

4) olfhum4.seq, bases 225-310 of F RS vHNp iV 3, inverse complementary to 
mARN 

5 ' GCATCAGCAGCTGCAGCTCGGTCACGGCGCTCTTGTACTTGTCCAGCTCCTGCTTGATCAGCTT 
CACCTTGGCGTCGGTGCCGTTG 

5) olfhumS.seq, bases 295-397 of F RSV HNpiv3, homologous to mRNA 

5 ' CTGCAGCTGCTGATGCAGAGCACCCCCGCCACCAACAACagaGCCAGGCGCGAGCTGCCCAGGT 
TCATGAACTACACCCTCAACAACACCAAGAACACCAACG 

6) olfhum6.seq, bases 378-496 of F RS vHNp iV 3, inverse complementary 
to mRNA 

GGTGCAGGACCTTGGACACCGCGATGCCGCTGGCGATGGCGGAGCCCACGCCCAGCAGGAAGCCCA 
GGAAgCGcctCTTgCgCTTCTTGCTCAGGGTCACGTTGGTGTTCTTGGTGTTG 

7) olfhum7.seq, bases 480-561 of F RS vHNp iV 3, homologous to mRNA 

5 ' GTCCAAGGTCCTGCACCTGGAGGGGGAGGTGAACAAGATCAAGAGCGCCCTGCTCTCCACCAAC 
AAGGCGGTGGTCAGCCTG 

8) olfhum8.seq, bases 543-633 of F R svHN Pi v3, inverse complementary 
to mARN 

5 ' GGGGAGCAatTGCTTGTCGATGTAGTTCTTGAGGTCCAGCACCTTGCTGGTCAGCACGCTCACG 
CCGTTGGACAGGCTGACCACCGCCTTG 

9) olfhum9.seq, bases 609-676 of F R svHN Pi v3, homologous to mRNA 
5'CTACATCGACAAGCAatTGCTCCCCATCGTGAACAAGCAGtcCTGCAGCATCTCTAACATTGAG 
ACCG 

10) olfhumlO.seq, bases 653-732 of F RSV HNpiv3, inverse complementary 
to mARN 

5'GCTGAACTCCCTGGTGATCTCCAGCAGCCTGTTGTTCTTCTGCTGGAACTCGATCACGGTCTCA 
ATGTTAGAGATGCTGC 
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11) olf humll . seq, bases 714-787 of F RS vHN Pi v3/ homologous to mRNA 

5 ' GATCACCAGGGAGTTCAGCGTGAACGCgGGcGTcACCACCCCGGTGAGCACCTACATGCTGACC 
AACAGCGAGC 

12) olfhuml2 . seq, bases 768-846 of F RSV HNpi V 3, inverse complementary 
to mARN 

5 ' GTTGGACATaAGCTTCTTCTGGTCGTTGGTGATGGGCATGTCGTTGATCAGGGACAGCAGCTCG 
CTGTTGGTCAGCATG 

13) olfhuml3 . seq, bases 825-916 of F R svHN Pi v3, homologous to mRNA 
5'CCAGAAGAAGCTtATGTCCAACAACGTGCAGATCGTGCGCCAGCAGAGCTACagCATCATGagC 
ATCATCAAGGAGGAGGTGCTGGCCTACG 

14) olfhum!4 . seq, bases 900-990 of F R svHNp iv3/ inverse complementary 
to mARN 

5 ' GGTGGTGCACAGGGGGGAGGTGTGCAGCTTCCAGCAGGGGGTGTCGATCACGCCGTACAGGGGC 
AGCTGCACCACGTAGGCCAGCACCTCC 

15) olfhumlS . seq, bases 975-1065 of FRsvHNp iV 3, homologous to mRNA 

5 ' CCCCCTGTGCACCACCAACACCAAGGAGGGCTCCAACATCTGCCTGACCCGCACCGACCGGGGC 
TGGTACTGCGACAACGCCGGCTCCGTG 

16) olfhuml6- seq, bases 1048-1133 of F^vHNpivs/ inverse 
complementary to mARN 

5 ' CTGTTCATGGTGTCGCAGAACACGCGGTTGGACTGCACCTTGCAGGTCTCCGCCAGGGGGAAGA 
AGGACACGGAGCCGGCGTTGTC 

17) olfhuml7 . seq, bases 1116-1210 of FrsvHNpivs/ homologous to mRNA 

5 ' CTGCGACACCATGAACAGCCTGACCCTGCCCAGCGAGGTGAACCTCTGCAACATCGACATCTTC 
AACCCCAAGTACGACTGCAAGATtATGacctcc 

18) olfhuml8 .seq, bases 1195-1295 of F RS vHNp iV 3/ inverse 
complementary to mARN 

gggaattctgtacacttggtcttgccgtagcaggacacgatggcgcccagggaggtgatcacggag 
ctgctcacgtcggtcttggaggtCATaATCTTGCAG 



[SEQUENCES ABOVE ARE SEQ ID NOs: 12 to 29, respectively] 



WO 00/18929 



PCT/EP99/07004 



30/73 



Fig. 29: Construction of pEE14 FRsyhurnHNp^ 



a) PCR fragment A 



Xbal 



BsrGI 



F RSV hum 1-1264 



ATG 



b) pNIV4120 +PCR fragment A 

BsrGI 

Xbal 



EcoRI 















F RSV hum 1-1264 


F RSV 1 265-1 578XHNpi V3 







-pUC19 



c) pEE14 F RSV hum HN PjV 3 



BsrGI 



Xbal 



F RSV hum 1-1264 



EcoRI 



F RSV 1265-1578XHN K 



PiV3 _ 



-pEE14 
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Fig. 30: Sequence of the 10 oligonucleotides from which PCR fragment B was generated. 

1) olfhnhuml9. seq, bases 1269-1353 of F RSV HNpiv3f homologous to mRNA 
5 ' cggcaagaccaagtgtacagcctccaacaagaaccgcggcatcatcaagaccttctccaacgg 
ctgcgactacgtgtccaacaag 3 ! 

2 ) olf hnhum20 . seq, bases 1336-1428 of F RS vHN P iv3, inverse 
complementary of mARN 

5' cttcacgtacaggctcttgccctcctgcttgttcacgtagtacagggtgttgcccacggacac 
ggtgtccacgcccttgttggacacgtagtc 3 ' 

3) olf hnhum21 . seq, bases 1413-1497 of F R svHNp iV 3, homologous to mARN 
5 1 gagcctgtacgtgaagggcgagcccatcatcaacttctacgacccgctggtgttcccctccga 
cgagttcgacgcctccatctccc 3 f 

4 ) olf hnhum22 . seq, bases!483-1599 of F^yHNpivs, inverse 
complementary of mARN 

5 1 gttcatgatgttggtggtggacttgccggcgttcacgttgtgcagcagctcgtcggacttgcg 
gatgaaggccaggctctggttgatcttctcgttcacctgggagatggaggcgtc 3 1 

5) olfhnhum23.seq, bases 1581-1691 of FusvHNpivs/ homologous to mARN 

5 ' caccaccaacatcatgaacaacgagttcatggaggtgaccgagaagatccagatggcctccga 
caacatcaacgacctgatccagtccggcgtgaacacccggctgctgac 3 ' 

6) olf hnhum2 4 . seq, bases 1677-1779 of F RS vHN Pi v3, inverse 
complementary of mARN 

5 1 gatggtgatctcgctgatgaacttccgcaggtcggacatctgctgggtcagggagatggggat 
gtagttctgcacgtggctctggatggtcagcagccgggtg 3 1 

7 ) olf hnhum25 . seq, bases 1761-1865 of F RS vHNp iV 3, homologous to mRNA 
5 ' catcagcgagatcaccatccggaacgacaaccaggaggtgcccccccagaggatcacccacga 
cgtgggcataaagcccctgaaccccgacgacttctggcgctg 3 1 

8) olfhnhum26. seq, bases 1849-1967 of F R svHNp iV 3, inverse 
complementary of mARN 

5 ' gtgcgcacgcagccgtccacggtggtgggcatggccagcaggccgggcccgggcatcagcctt 
atcttgggggtcttcatcagggaggggaggccggaggtgcagcgccagaagtcgtc 3 1 

9) olfhnhum27 . seq, bases 1953-2059 of F RS vHNp iV 3/ homologous to mRNA 
5 ' cggctgcgtgcgcaccccctccctggtgatcaacgacctgatctacgcctacacctccaacct 
gatcacccgcggctgccaggacatcggcaagtcctaccaggtgc 3 1 

10) olfhnhum28 . seq, bases 2043-2154 of FrsvHNpivs/ , inverse 
complementary of mARN 

5 ' ggacttcctgttgtcgttgatgttgaaggtgtgggagatccgggggttcaggtcgggcaccag 
gtcggagttcacggtgatgatgccgatctgcagcacctggtaggacttg 



[SEQUENCES ABOVE ARE SEQ ID NOs : 30-39, respectively] 
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Fig. 31: Sequence of the 16 oligonucleotides from which PCR fragment C was generated. 

1 ) olfhnum29 . seq, bases 2139-2229 of FrsvHNp^, homologous to mRNA 

5 1 cgacaacaggaagtcctgctccctggccctcctgaacaccgacgtgtaccagctgtgctccac 
gcccaaggtggacgagcgctccgactac 3 1 

2) olfhnhum30 . seq, bases 2214-2307 of F RSV HNpiv3f inverse 
complementary to mRNA 

5' gttcttgaagcgggtggtggagatggagccgtcgtggttgacgatgtccagcacgatgtcctc 
gatgccggagctggcgtagtcggagcgctcg 3 1 

3) olfhnhum31. seq, bases 2292-2398 of F RSV HNpiv3/ homologous to mRNA 
5' cacccgcttcaagaacaacaacatcagcttcgaccagccctacgccgccctgtacccctccgt 
gggccccggcatctactacaagggcaagatcatcttcctgggc 3 1 

4 ) olfhnhum32 . seq, bases 2382-2472 of F RS vHNpiv3, inverse 
complementary to mRNA 

5' ccgctgggtcttgccggggcacccggtggtgttgcagatggcgttctcgttgatggggtgctc 
caggccgccgtagcccaggaagatgatc 3 ' 

5) olfhnhum33. seq, bases 2457-2549 of F RS vHNpi V 3, homologous to mRNA 
5 ' cggcaagacccagcgggactgcaaccaggcctcccacagcccctggttctccgaccgccgcat 
ggtgaactccatcatcgtggtggacaaggg 3 ' 

6) olf hnhum34 . seq, bases 2532-2643 of F RS vHN Pi v3/ inverse 
complementary to mRNA 

5 ' cttgttgcccagcagcagcaggcggccctcggagccccagtagttctgccgcatggagatggt 
ccacaccttcagcttggggatggagttcaggcccttgtccaccacgatg 3 ' 

7) olfhnhum35. seq, bases 2628-2726 of FRsvHNpi V 3/ homologous to mRNA 
5 1 gctgctgggcaacaagatctacatctacacccgctccaccagctggcacagcaagctgcagct 
gggcatcatcgacatcaccgactacagcgacatccg 3 ' 

8) olfhnhum36. seq, bases 2710-2781 of F RS vHNp iV 3, inverse 
complementary to mRNA 

5' ggggcactcgttgttgccgggccggctcagcacgttgtgccaggtccacttgatgcggatgtc 
gctgtagtc 3' 

9) olf hnhum37 . seq, bases 2765-2836 of F RS vHNpi V 3, homologous to mRNA 
5' gcaacaacgagtgcccctggggccactcctgccccgacggctgcatcaccggcgtgtacaccg 
acgcctacc 3 1 

10) olfhnhum38 .seq, bases 2820-2889 of F RS vHNpiV3, inverse 
complementary to mRNA 

5 ' cttctgggagtccaggatcacggagctcacgatgctgccggtggggttcagggggtaggcgtc 
ggtgtac 
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11) olfhnhum39. seq, bases 2874-2943 of F RS vHNp iV 3, homologous to mRNA 
5 * cctggactcccagaagtcccgggtgaaccccgtgatcacctacagcacctccaccgagcgcgt 

gaacgag 

12) olfhnhum40.seq, bases 2927-2994 of F RS vHN Pi v3f rom: 1 to: 68, 
inverse complementary to mRNA 

5' gcagctggtggtggtgtagccggcgctcagggtcttgttgcggatggccagctcgttcacgcg 
ctcgg 

13) olfhnhum41.seq, bases 2979-3043 of F RS vHN PiV 3/ homologous to mRNA 
5' caccaccaccagctgcatcacccactacaacaagggctactgcttccacatcgtggagatcaa 

cc 

14) olfhnhum42.seq, bases 3027-3085 of F RS vHNpiv3, inverse 
complementary to mRNA 

5 ■ cggtcttgaacagcatgggctggaaggtgtccaggctcttgtggttgatctccacgatg 3 1 

15) olfhnhum43. seq, bases 3069-3114 of F RS vHN P1V 3, homologous to mRNA 
5' catgctgttcaagaccgagatccccaagagctgcagctaaGAATTC 3 ' 



[SEQUENCES ABOVE ARE SEQ ID NOs : 40-54, respectively] 
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Fig. 32 : Construction of pEE14F RSV hum HN m3 bum 

a) pNIV4120 +PCR fragment A 



BsrGI 



Xbal 



F RSV^um 1-1264 



EcoRJ 



F RSV I265-1578XHN 



PiV3- 



-pUC19 



b) PCR fragment B 



BsrGI 



Kpnl 



Frsv HN PjV 3 1 264-2 1 3 6hum 



c) PCR fragment C 



Kpnl 



EcoRI 



Frsv HNpiV32136- 
3090hum 



d) pEE14 F RSV hum HNpjv, hum 



Xbal 



EcoRI 



F RSV humfINp jV 3hum 



pEE14 



WO 00/18929 



PCT/EP99/07004 



35/73 

Fig . 33A : Humanized nucleic acids sequence of F RSV HN PiV3 (upper sequence) compared to 
the original sequence found in the pNIV4120. 

7 AGAGGATCC ACCATGGAGCTGCTGATtttaAAGACCAACGCCA 4 9 

MINIMI I I I I I I 1 I I I I I I I I I I I I I I I I I I 

22 62 AGAGGATCCCCCGGGTAccatggagttgctaatcctcaaaacaaatgcaa 2311 

50 TCACCGCCATCCTGGCCGCGGTGACCCTCTGCTTCGCGTCCAGCCAGAAC 99 

I I I I I I I I I I I II II I I II I I I I I II II I II M II III 
2312 ttaccgcaatccttgctgcagtcacactctgttttgcttccagtcaaaac 2361 

100 ATCACCGAGGAGTTCTACCAGTCCACCTGCtctGCTGTCAGCAAGGGCTA 14 9 

Mill II M M M M I I M Ml Ml I I I I I I I I I I I I I 
2362 atcactgaagaattttatcaatcaacatgcagtgcagtcagcaaaggcta 2411 

150 CCTGAGCGCGCTGAGGACGGGGTGGTACACtAGtGTGATCACCATCGAGC 199 

II II II II II II II Mill I I I I I II I II II II II 
2412 tcttagtgctctaagaactggttggtatactagtgttataactatagaat 24 61 

200 TGAGCAACATCAAGGAGAACAAGTGCAACGGCACCGACGCCAAGGTGAAG 24 9 

I M M MINIM II I I I I I II M II II I I I Mill II 
24 62 taagtaatatcaaggaaaataagtgtaatggaacagacgctaaggtaaaa 2511 

250 CTGATCAAGCAGGAGCTGGACAAGTACAAGAGCGCCGTGACCGAGCTGCA 2 99 

I I II II II I I I II II II II I! M M M II I I I I 
2512 ttgataaaacaagaattagataaatataaaagtgctgtaacagaattgca 2561 

« • • • • 

300 GCTGCTGATGCAGAGCACCCCCGCCACCAACAACagaGCCAGGCGCGAGC 34 9 

I MM INN I II I I II II I I I II I I I II I I II I I II ! 
2562 gttgctcatgcaaagcacaccggcaaccaacaatcgagccagaagagaac 2611 

* * • ■ * 

350 TGCCCAGGTTCATGAACTACACCCTCAACAACACCAAGAACACCAACGTG 399 

I M I I I M I II M II II I I M I I I I Mill II I II I I II 
2612 taccaaggtttatgaattatacactcaacaataccaaaaataccaatgta 2661 

400 ACCCTGAGCAAGAAGcGcAAGaggCGcTTCCTGGGCTTCCTGCTGGGCGT 44 9 

M I II I M I M I II II I II I I I I I M I I I I I I I 
2 662 acattaagcaagaaaaggaaaagaagatttcttggctttttgttaggtgt 2711 

4 50 GGGCTCCGCCATCGCCAGCGGCATCGCGGTGTCCAAGGTCCTGCACCTGG 4 99 

II M II I I I I II II I I II I II II II I I M I I I I I I I I I I I 
2712 tggatctgcaatcgccagtggcattgctgtatctaaggtcctgcacctag 2761 



500 AGGGGGAGGTGAACAAGATCAAGAGCGCCCTGCTCTCCACCAACAAGGCG 54 9 

I II I M I I I I II II Mill II II II II Mill I I I I I I M 
2762 aaggggaagtgaacaaaatcaaaagtgctctactatccacaaacaaggct 2811 
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550 GTGGTCAGCCTGTCCAACGGCGTGAGCGTGCTGACCAGCAAGGTGCTGGA 599 
II MINI I I 1 II II II II II I I I I I I I II III I II 
2812 gtagtcagcttatcaaatggagttagtgtcttaaccagcaaagtgttaga 28 61 

600 CCTCAAGAACTACATCGACAAGCAatTGCTCCCCATCGTGAACAAGCAGt 64 9 
I I I I M I II I I II II I I I III I II II I I I M I I I I I I 
28 62 cctcaaaaactatatagataaacagttgttacctattgtgaacaagcaaa 2 911 

650 cCTGCAGCATCTCTAACATTGAGACCGTGATCGAGTTCCAGCAGAAGAAC 699 
(II I I I I I II I I! I I I I I II I I I I I I I I I I II I II I I I I I I 
2 912 gctgtagcatatcaaacattgaaactgtgatagagttccaacaaaagaac 2 961 

7 00 AACAGGCTGCTGGAGATCACCAGGGAGTTCAGCGTGAACGCgGGcGTcAC 749 
I I I I I II II I I I I I I I I I I I I I II II II II II II II II 
2962 aacagactactagagattaccagggaatttagtgttaatgcaggtgtaac 3011 

7 50 CACCCCGGTGAGCACCTACATGCTGACCAACAGCGAGCTGCTGTCCCTGA 7 99 
II I! II Mill II II! I II II II II I III II 
3012 tacacctgtaagcacttatatgttaacaaatagtgaattattatcattaa 3061 

800 TCAACGACATGCCCATCACCAACGACCAGAAGAAGCTtATGTCCAACAAC 849 

I I I I II Mill li II II II I I II I III I I I I I I I II I I I 
30 62 tcaatgatatgcctataacaaatgatcagaaaaagttaatgtccaacaat 3111 

850 GTGCAGATCGTGCGCCAGCAGAGCTACagCATCATGagCATCATCAAGGA 8 99 

I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I 
3112 gttcaaatagttagacagcaaagttactctatcatgtccataataaagga 3161 



900 GGAGGTGCTGGCCTACGTGGTGCAGCTGCCCCTGTACGGCGTGATCGACA 94 9 

I I I I I I I I I I I I I I I I I II II II II II II II I 
3162 ggaagtcttagcatatgtagtacaattaccactatatggtgtaatagata 3211 

950 CCCCCTGCTGGAAGCTGCACACCTCCCCCCTGTGCACCACCAACACCAAG 999 

I II II I I I I I I I I I I I I I I II II II II II I I II I I I I II I 
3212 caccttgttggaaactgcacacatcccctctatgtacaaccaacacaaag 3261 

1000 GAGGGCTCCAACATCTGCCTGACCCGCACCGACCGGGGCTGGTACTGCGA 104 9 

II II I I I I I I I I I I I I II I MINI 1 II II I I I I II II 
3262 gaagggtccaacatctgtttaacaagaaccgacagaggatggtactgtga 3311 

1050 CAACGCCGGCTCCGTGTCCTTCTTCCCCCTGGCGGAGACCTGCAAGGTGC 1099 

III II II II II II I I I II I II II II II II II II II I 
3312 caatgcaggatcagtatctttcttcccactagctgaaacatgtaaagttc 3361 

1100 AGTCCAACCGCGTGTTCTGCGACACCATGAACAGCCTGACCCTGCCCAGC 114 9 
I I I I I I I I I I I I I I I I II I I M I I I I III I I I I I 
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3362 aatcgaatcgagtattttgtgacacaatgaacagtttaacattaccaagt 3411 

1150 GAGGTGAACCTCTGCAACATCGACATCTTCAACCCCAAGTACGACTGCAA 1199 

II f I II II I I I I I I I I I Mill I M I I I I I I I I II II Mill 
3412 gaagtaaatctctgcaacattgacatattcaaccccaaatatgattgcaa 34 61 

1200 GATtATGacctccaagaccgacgtgagcagctccgtgatcacctccctgg 124 9 

I I I I I I I I I I II II II II I I I I I I I I I I I I I I I I II II I 
34 62 aattatgacttcaaaaacagatgtaagcagctccgttatcacatctctag 3511 

* • • * • 
1250 gcgccatcgtgtcctgctacggcaagaccaagtgtacagcctccaacaag 12 99 

I Mill I I I II I I I I I Mill II II II II I I I E Mill II 

3512 gagccattgtgtcatgctatggcaaaactaaatgtacagcatccaataaa 3561 

1300 aaccgcggcatcatcaagaccttctccaacggctgcgactacgtgtccaa 134 9 

II II M Mill I II I I II II Mill II II II II M II 
3562 aatcgtggaatcataaagacattttctaacgggtgtgattatgtatcaaa 3611 

* ■ * • * 

1350 caagggcgtggacaccgtgtccgtgggcaacaccctgtactacgtgaaca 1399 

I I M I I II I I I I I II I I I I I II I I II I I I I I II I I I 
3612 taagggggtggacactgtgtctgtaggtaatacattatattatgtaaata 3661 

1400 agcaggagggcaagagcctgtacgtgaagggcgagcccatcatcaacttc 14 4 9 

I I I I II I I I I I II II II II II M II II II M II Ml 
3662 agcaagaaggcaaaagtctctatgtaaaaggtgaaccaataataaatttc 3711 

1450 tacgacccgctggtgttcccctccgacgagttcgacgcctccatctccca 14 99 

1 I I I I II I II I I I II II I I I I II II I I II II I I I I II 
3712 tatgacccattagtgttcccctctgatgaatttgatgcatcaatatctca 37 61 

1500 ggtgaacgagaagatcaaccagagcctggccttcatccgcaagtccgacg 154 9 

II II II II I M I I I I I I II I I I I II II II M M I I I I I I 

37 62 agtcaatgagaagattaaccagagcctagcatttattcgtaaatccgatg 3811 

1550 agctgctgcacaacgtgaacgccggcaagtccaccaccaacatcatgaac 1599 

I I .1 II I I II I I I I I I II II I I II I I II I II I M I I I 
3812 aattattacataatgtaaatgctggtaaatccaccacaaatatcatgAAC 3861 

1600 aacgagttcatggaggtgaccgagaagatccagatggcctccgacaacat 164 9 

II I I I II II I II II II II I II I I I I I II I I I II II II II 

38 62 AATGAGTTTATGGAAGTTACAGAAAAGATCCAAATGGCATCGGATAATAT 3911 

■ • * ■ > 

1650 caacgacctgatccagtccggcgtgaacacccggctgctgaccatccaga 1699 

II I I I I II I I I I I I I II I I I II I II I II II II I M I 
3 912 TAATGATCTAATACAGTCAGGAGTGAATACAAGGCTTCTTACAATTCAGA 3961 
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17 00 gccacgtgcagaactacatccccatctccctgacccagcagatgtccgac 17 4 9 

I II II INN II II II II II I I I I I I II I I I I I I I 

3962 GTCATGTCCAGAATTATATACCaATATCATTGACACAACAAATGTCGGAT 4 011 

• • • • . 

17 50 ctgcggaagttcatcagcgagatcaccatccggaacgacaaccaggaggt 17 99 

II M I I I I II I II i I I ! I I M II I I I I II I I I I I I 

4 012 CTTAGGAAATTCATTAGTGAAATTACAATTAGGAATGATAATCAAGAAGT 4061 

18 00 gcccccccagaggatcacccacgacgtgggcataaagcccctgaaccccg 18 4 9 

III II II II II II II II I I I I I I I I I II II I II II I 

40 62 GCCTCCACAAAGAATAACACATGATGTGGGCATAAAACCTTTAAATCCAG 4111 

18 50 acgacttctggcgctgcacctccggcctcccctccctgatgaagaccccc 18 99 

I II II I II I I I I II II II II II II I MM! II II 
4112 ATGATTTTTGGAGATGCACGTCTGGTCTTCCATCTTTAATGAAAACTCCA 4161 

1900 aagataaggctgatgcccgggcccggcctgctggccatgcccaccaccgt 194 9 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 
4162 AAAATAAGGTTAATGCCGGGGCCGGGATTATTAGCTATGCCAACGACTGT 4211 

1950 ggacggctgcgtgcgcaccccctccctggtgatcaacgacctgatctacg 1999 

II Mill II I II II III I II II II II Mill II I 
4212 TGATGGCTGTGTTAGAACTCCGTCCTTAGTTATAAATGATCTGATTTATG 42 61 

2000 cctacacctccaacctgatcacccgcggctgccaggacatcggcaagtcc 204 9 

I M II I II M II II II II II I I II M II M II II M 

42 62 CTTATACCTCGAATCTAATTACTCGAGGTTGCCAGGATATAGGAAAATCA 4 311 

2050 taccaggtgctgcagatcggcatcatcaccgtgaactccgacctggtacc 20 99 

II II 1 I I M I I I I I I I II I I I I I II II I I I II M II I 

4 312 T ATCAAGTATTACAGATAGGGATAATAACTGTAAACTCAGACTTGGTACC 4 361 

2100 cgacctgaacccccggatctcccacaccttcaacatcaacgacaacagga 214 9 

Ml I II II II II M I II II I II II II I II I I I I I II I 
4 362 TGACTTAAATCCTAGGATCTCTCATACTTTCAACATAAATGACAATAGAA 4 411 

2150 agtcctgctccctggccctcctgaacaccgacgtgtaccagctgtgctcc 2199 

I I I I II II II M I I I I I I I I I I II II II II Mill II 
4 412 AGTCATGTTCTCTAGCACTCCTAAACACAGATGTATATCAACTGTGTTCG 44 61 

2200 acgcccaaggtggacgagcgctccgactacgccagctccggcatcgagga 224 9 

I I I II I I II II M I I I II II II I I II I I I II II 
4 4 62 ACTCCCAAAGTTGATGAAAGATCAGATTATGCATCATCAGGCATAGAAGA 4 511 

2250 catcgtgctggacatcgtcaaccacgacggctccatctccaccacccgct 2299 

II II II II II II I II M M II II II I II M II I I 
4512 TATTGTACTTGATATtGTCAATCATGATGGTTCAATCTCAACAACAAGAT 4 561 
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2300 tcaagaacaacaacatcagcttcgaccagccctacgccgccctgtacccc 2349 

I I I I I I I I I I I It I I II I I I I I I I I I I I I I I I I I I 

4 562 TTAAGAACAATAATATAAGTTTTGATCAACCATATGCGGCATTATACCCA 4 611 

2350 tccgtgggccccggcatctactacaagggcaagatcatcttcctgggcta 2399 

II II I t II II II I I I II I I I I I I I I II II II II II II 

4 612 TCTGTTGGACCAGGGATATACTACAAAGGCAAAATAATATTTCTCGGGTA 4 661 

24 00 cggcggcctggagcaccccatcaacgagaacgccatctgcaacaccaccg 2449 

II II II II II I I I I II I I I II II II II I I I M II II I 
4 662 TGGAGGTCTTGAACATCCAATAAATGAGAATGCAATCTGCAACACAACTG 4 711 

24 50 ggtgccccggcaagacccagcgggactgcaaccaggcctcccacagcccc 24 99 

I I I I I I I I I II II II I I I If I I II I II II I II I I I I II 
4712 GGTGTCCCGGGAAAACGCAGAGAGACTGCAATCAGGCATCTCATAGTCCT 4761 

• • * . . 
2500 tggttctccgaccgccgcatggtgaactccatcatcgtggtggacaaggg 254 9 

I I I I I M M I I I I I I II I M I I I I I M I I I I I I I I I I I I 

47 62 TGGTTTTCAGACAGAAGGATGGTCAACTCCATTATTGTTGTTGACAAGGG 4 811 

2550 cctgaactccatccccaagctgaaggtgtggaccatctccatgcggcaga 2599 

I I Mill II II I I I I I I I I I I I I I I I I I I I I I M I II I 
4812 CT T AAAC T C AAT T CC AAAAC T G AAGGT AT GG AC GAT AT C CAT GAG AC AAA 4 8 61 

• • • • • 

2 600 actactggggctccgagggccgcctgctgctgctgggcaacaagatctac 2 64 9 
I I I I M I I I I I I I I I I I I I I I I I I I I I I M I I II I I I 

48 62 ATTACTGGGGGTCAGAAGGAAGGCTACTTCTACTAGGTAACAAGATCTAT 4 911 

2 650 atctacacccgctccaccagctggcacagcaagctgcaactgggcatcat 2 699 

I I I I I I I I i I I II I I I I I I I I I M i I f I I I II I I 

4 912 ATATATACAAGATCTACAAGTTGGCATAGCAAGTTAC AATTAGGAATAAT 4 961 

27 00 cgacatcaccgactacagcgacatccgcatcaagtggacctggcacaacg 274 9 

= ! I! II M Mill II M I M M I II I I I II I I II I 
4 962 TGATATTACTGATTACAGTGATATAAGAATAAAATGGACATGGCATAATG 5011 

• • • • • 

27 50 tgctgagccggcccggcaacaacgagtgcccctggggccactcctgcccc 27 99 

MM I II M M II I II I I I I II I I I II II II II I 

5012 TGCTATCAAGACCAGGAAACAATGAATGTCCATGGGGACATTCATGCCCA 5061 

• » * • • 
2800 gacggctgcatcaccggcgtgtacaccgacgcctaccccctgaaccccac 2849 

II II M II II II II II M II M II II II II Mill 
5062 GATGGATGTATAACAGGAGTATATACTGATGCATATCCACTCAATCCCAC 5111 

• • * * • 

2850 cggcagcatcgtgagctccgtgatcctggactcccagaagtcccgggtga 2899 

II II M I M I I I II II I I M II M II I I I I I I 
5112 AGGGAGCATTGTGTCATCTGTCATATTAGACTCGCAAAAATCGAGAGTAA 5161 
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2 900 accccgtgatcacctacagcacctccaccgagcgcgtgaacgagctggcc 294 9 

I I E I M I ! I I I I I II I I I I I I I I I I I I I I I I I I I I I 
5162 ACCCAGTCATAACTTACTCAACATCAACTGAAAGGGTAAACGAGCTGGCC 5211 

2 950 atccgcaacaagaccctgagcgccggctacaccaccaccagctgcatcac 2999 

I II I I I I II I II II I I II I I I I II II I I I I I I I I II 
5212 ATCCGAAACAAAACACTCTCAGCTGGATATACAACAACGAGCTGCATTAC 52 61 

■ ■ • * • 

3000 ccactacaacaagggctactgcttccacatcgtggagatcaaccacaaga 304 9 

I I I I I I I I M II II II II I I II II II II II II II I 
52 62 ACACTATAACAAAGGATATTGTTTTCATATAGTAGAAATAAATCATAAAA 5311 



3050 gcctggacaccttccagcccatgctgttcaagaccgagatccccaagagc 3099 

II I I I I I I I I I I I II III I I I I I I I II Mill II II II I 
5312 GCTTAGACACATTCCAACCTATGTTGTTCAAAACAGAGATTCCAAAAAGC 5361 

3100 tgcagctaaGAAT 3112 

I I I I I I I I II 
5362 TGCAGTTAATCAT 5374 
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Fig. 33B 

(Linear) MAP of: FrsvHNpiv3 . seq check: 7448 from: 1 to: 
3090 nucleic acids sequence of FrsvHNpiv3 (non humanised) 

atggagttgctaatcctcaaaacaaatgcaattaccgcaatccttgctgc 
agtcacactctgttttgcttccagtcaaaacatcactgaagaattttatc 
aatcaacatgcagtgcagtcagcaaaggctatcttagtgctctaagaact 
ggttggtatactagtgttataactatagaattaagtaatatcaaggaaaa 
taagtgtaatggaacagacgctaaggtaaaattgataaaacaagaattag 
ataaatataaaagtgctgtaacagaattgcagttgctcatgcaaagcaca 
ccggcaaccaacaatcgagccagaagagaactaccaaggtttatgaatta 
tacactcaacaataccaaaaataccaatgtaacattaagcaagaaaagga 
aaagaagatttcttggctttttgttaggtgttggatctgcaatcgccagt 
ggcattgctgtatctaaggtcctgcacctagaaggggaagtgaacaaaat 
caaaagtgctctactatccacaaacaaggctgtagtcagcttatcaaatg 
gagttagtgtcttaaccagcaaagtgttagacctcaaaaactatatagat 
aaacagttgttacctattgtgaacaagcaaagctgtagcatatcaaacat 
tgaaactgtgatagagttccaacaaaagaacaacagactactagagatta 
ccagggaatttagtgttaatgcaggtgtaactacacctgtaagcacttat 
atgttaacaaatagtgaattattatcattaatcaatgatatgcctataac 
aaatgatcagaaaaagttaatgtccaacaatgttcaaatagttagacagc 
aaagttactctatcatgtccataataaaggaggaagtcttagcatatgta 
gtacaattaccactatatggtgtaatagatacaccttgttggaaactgca 
cacatcccctctatgtacaaccaacacaaaggaagggtccaacatctgtt 
taacaagaaccgacagaggatggtactgtgacaatgcaggatcagtatct 
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ttcttcccactagctgaaacatgtaaagttcaatcgaatcgagtattttg 
tgacacaatgaacagtttaacattaccaagtgaagtaaatctctgcaaca 
ttgacatattcaaccccaaatatgattgcaaaattatgacttcaaaaaca 
gatgtaagcagctccgttatcacatctctaggagccattgtgtcatgcta 
tggcaaaactaaatgtacagcatccaataaaaatcgtggaatcataaaga 
cattttctaacgggtgtgattatgtatcaaataagggggtggacactgtg 
tctgtaggtaatacattatattatgtaaataagcaagaaggcaaaagtct 
ctatgtaaaaggtgaaccaataataaatttctatgacccattagtgttcc 
cctctgatgaatttgatgcatcaatatctcaagtcaatgagaagattaac 
cagagcctagcatttattcgtaaatccgatgaattattacataatgtaaa 
tgctggtaaatccaccacaaatatcatgAACAATGAGTTTATGGAAGTTA 
CAGAAAAGATCCAAATGGCATCGGATAATATTAATGATCTAATACAGTCA 
GGAGTGAATACAAGGCTTCTTACAATTCAGAGTCATGTCCAGAATTATAT 
ACCaATATCATTGACACAACAAATGTCGGATCTTAGGAAATTCATTAGTG 
AAATTACAATTAGGAATGATAATCAAGAAGTGCCTCCACAAAGAATAACA 
CATGATGTGGGCATAAAACCTTTAAATCCAGATGATTTTTGGAGATGCAC 
GTCTGGTCTTCCATCTTTAATGAAAACTCCAAAAATAAGGTTAATGCCGG 
GGCCGGGATTATTAGCTATGCCAACGACTGTTGATGGCTGTGTTAGAACT 
CCGTCCTTAGTTATAAATGATCTGATTTATGCTTATACCTCaAATCTAAT 
TACTCGAGGTTGCCAGGATATAGGAAAATCATATCAAGTATTACAGATAG 
GGATAATAACTGTAAACTCAGACTTGGTACCTGACTTAAATCCTAGGATC 
TCTCATACTTTCAACATAAATGACAATAGAAAGTCATGTTCTCTAGCACT 
CCTAAAtACAGATGTATATCAACTGTGTTCGACTCCCAAAGTTGATGAAA 
GATCAGATTATGCATCATCAGGCATAGAAGATATTGTACTTGATATtGTC 
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AATC AT GATG GTT C AATC T C AAC AACAAG AT T T AAG AAC AAT AAT AT AAG 
TTTTGATCAACCATATGCGGCATTATACCCATCTGTTGGACCAGGGATAT 
ACTACAAAGGCAAAATAATATTTCTCGGGTATGGAGGTCTTGAACATCCA 
ATAAATGAGAATGCAATCTGCAACACAACTGGGTGTCCCGGGAAAACGCA' 
GAGAGACTGCAATCAGGCATCTCATAGTCCcTGGTTTTCAGACAGAAGGA 
TGGTCAACTCCATTATTGTTGTTGACAAGGGCTTAAACTCAATTCCAAAA 
CTGAAGGTATGGACGATATCCATGAGACAAAATTACTGGGGGTCAGAAGG 
AAGGCT ACT T C TACT AGGT AAC AAGAT C T AT AT AT AT AC AAGAT CT AC AA 
GTTGGCATAGCAAGTTACAATTAGGAATAATTGATATTACTGATTACAGT 
GATATAAGAATAAAATGGACATGGCATAATGTGtTATCAAGACCAGGAAA 
CAATGAATGTCCATGGGGACATTCATGtCCAGATGGATGTATAACAGGAG 
TATATACTGATGCATATCCgCTCAATCCCACAGGGAGCATTGTGTCATCT 
GTCATATTAGACTCGCAAAAATCGAGAGTAAACCCAGTCATAACTTACTC 
AACAtCAACTGAAAGGGTAAACGAGCTGGCCATCCGAAACAAAACACTCT 
CAGCTGGATATACAACAACGAGCTGCATTACACACTATAACAAAGGATAT 
TGTTTTCATATAGTAGAAATAAATCATAAAAGCTTAGACACATTCCAACC 
TATGTTGTTCAAAACAGAGATTCCAAAAAGCTGCAGTTAA 
[SEQ ID NO: 55] 
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Fig. 33C 

(Linear) MAP of: FrhumHNphum. seq check: 9920 from: 1 to: 
3090 Humanised nucleic acids sequence of FRSVHNPiV3 

GGGTGGTACACtAGtGTGATCACCATCGAGCTGAGCAACATCAAGGAGAA 
CAAGTGCAACGGCACCGACGCCAAGGTGAAGCTGATCAAGCAGGAGCTGG 
ACAAGTACAAGAGCGCCGTGACCGAGCTGCAGCTGCTGATGCAGAGCACC 
CCCGCCACCAACAACagaGCCAGGCGCGAGCTGCCCAGGTTCATGAACTA 
CACCCTCAACAACACCAAGAACACCAACGTGACCCTGAGCAAGAAGcGcA 
AGaggCGcTTCCTGGGCTTCCTGCTGGGCGTGGGCTCCGCCATCGCCAGC 
GGCATCGCGGTGTCCAAGGTCCTGCACCTGGAGGGGGAGGTGAACAAGAT 
CAAGAGCGCCCTGCTCTCCACCAACAAGGCGGTGGTCAGCCTGTCCAACG 
GCGTGAGCGTGCTGACCAGCAAGGTGCTGGACCTCAAGAACTACATCGAC 
AAGCAatTGCTCCCCATCGTGAACAAGCAGtcCTGCAGCATCTCTAACAT 
TGAGACCGTGATCGAGTTCCAGCAGAAGAACAACAGGCTGCTGGAGATCA 
CCAGGGAGTTCAGCGTGAACGCgGGcGTcACCACCCCGGTGAGCACCTAC 
ATGCTGACCAACAGCGAGCTGCTGTCCCTGATCAACGACATGCCCATCAC 
CAACGACCAGAAGAAGCTtATGTCCAACAACGTGCAGATCGTGCGCCAGC 
AGAGCTACagCATCATGagCATCATCAAGGAGGAGGTGCTGGCCTACGTG 
GTGCAGCTGCCCCTGTACGGCGTGATCGACACCCCCTGCTGGAAGCTGCA 
CACCTCCCCCCTGTGCACCACCAACACCAAGGAGGGCTCCAACATCTGCC 
TGACCCGCACCGACCGGGGCTGGTACTGCGACAACGCCGGCTCCGTGTCC 
TTCTTCCCCCTGGCGGAGACCTGCAAGGTGCAGTCCAACCGCGTGTTCTG 
CGACACCATGAACAGCCTGACCCTGCCCAGCGAGGTGAACCTCTGCAACA 
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TCGACATCTTCAACCCCAAGTACGACTGCAAGATtATGacctccaagacc 

gacgtgagcagctccgtgatcacctccctgggcgccatcgtgtcctgcta 

cggcaagaccaagtgtacagcctccaacaagaaccgcggcatcatcaaga 

ccttctccaacggctgcgactacgtgtccaacaagggcgtggacaccgtg 

tccgtgggcaacaccctgtact'acgtgaacaagcaggagggcaagagcct 

gtacgtgaagggcgagcccatcatcaacttctacgacccgctggtgttcc 

cctccgacgagttcgacgcctccatctcccaggtgaacgagaagatcaac 

cagagcctggccttcatccgcaagtccgacgagctgctgcacaacgtgaa 

cgccggcaagtccaccaccaacatcatgaacaacgagttcatggaggtga 

ccgagaagatccagatggcctccgacaacatcaacgacctgatccagtcc 

ggcgtgaacacccggctgctgaccatccagagccacgtgcagaactacat 

cccca tctccctgacccagcagatgtccgacctgcggaagttcatcagcg 

agatcaccatccggaacgacaaccaggaggtgcccccccagaggatcacc 

cacgacgtgggcataaagcccctgaaccccgacgacttctggcgctgcac 

ctccggcctcccctccctgatgaagacccccaagataaggctgatgcccg 

ggcccggcctgctggccatgcccaccaccgtggacggctgcgtgcgcacc 

ccctccctggtgatcaacgacctgatctacgcctacacctccaacctgat 

cacccgcggctgccaggacatcggcaagtcctaccaggtgctgcagatcg 

gcatcatcaccgtgaactccgacctggtacccgacctgaacccccggatc 

tcccacaccttcaacatcaacgacaacaggaagtcctgctccctggccct 

cctgaacaccgacgtgtaccagctgtgctccacgcccaaggtggacgagc 

gctccgactacgccagctccggcatcgaggacatcgtgctggacatcgtc 

aaccacgacggctccatctccaccacccgcttcaagaacaacaacatcag 

cttcgaccagccctacgccgccctgtacccctccgtgggccccggcatct 
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actacaagggcaagatcatcttcctgggctacggcggcctggagcacccc 
atcaacgagaacgccatctgcaacaccaccgggtgccccggcaagaccca 
gcgggactgcaaccaggcctcccacagcccctggttctccgaccgccgca 
tggtgaactccatcatcgtggtggacaagggcctgaactccatccccaag 
ctgaaggtgtggaccatctccatgcggcagaactactggggctccgaggg 
ccgcctgctgctgctgggcaacaagatctacatctacacccgctccacca 
gctggcacagcaagctgcagctgggcatcatcgacatcaccgactacagc 
gacatccgcatcaagtggacctggcacaacgtgctgagccggcccggcaa 
caacgagtgcccctggggccactcctgccccgacggctgcatcaccggcg 
tgtacaccgacgcctaccccctgaaccccaccggcagcatcgtgagctcc 
gtgatcctggactcccagaagtcccgggtgaaccccgtgatcacctacag 
cacctccaccgagcgcgtgaacgagctggccatccgcaacaagaccctga 
gcgccggctacaccaccaccagctgcatcacccactacaacaagggctac 
tgcttccacatcgtggagatcaaccacaagagcctggacaccttccagcc 
catgctgttcaagaccgagatccccaagagctgcagctaa 

[SEQ ID NO : 56] 
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Fig. 34A 
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Fig. 34B: Humanization impact on the level of expression of F RSV HN 
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Fig 35: Codon usage of F MuV H Mv and highly expressed human genes (hum high exp) 
The frequencies (xlOO) of the individual codons are shown for each of the 
degenerately encoded amino acids, and the most prevalent codon is shown in bold. 
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Fig . 36: Schematic diagram of the PCR synthesis of each fragment in which X and 
Y are restriction sites that allow retrieval of the full size fragment from the cloning 
vector. 
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Fig 37 : Sequence of the 12 oligonucleotides from which PCR fragment A was 
generated. 



l)oli 1 FmuvHmv 1-98, horn ARN 

ggtctagaccaccATGAAGGCGTTCCCCGTGATCTGCCTGGGCTTCGCCATCTTCTCCAG 

+ + + + + + 60 

CAGCATCTGCGTGAACATCAACATCCTGCAGCAGATCG 
+ + + 98 



2) oii 2 FmuvHmv 82-181, inv comp ARN 

GTTGGGCAGCAGCTTGACCACCACGTAGGAGCTGGAGCTCTGGGAGTAGTAGCTCAGCTG 

+ + + + + + 60 

CCTCACCTGCTGCTTGATGTATCCGATCTGCTGCAGGATG 
+ + + + 100 



3) oli 3 FmuvHmv, 166-264 horn ARN 

CAAGCTGCTGCCCAACATCCAGCCCACCGACAACAGCTGCGAGTTCAAGAGCGTGACCCA 

+ + + + + + 60 

GTACAACAAGACCCTGAGCAACCTGCTGCTGCCCATCGC 
+ + + 99 



4) oli 4 FmuvHmv, 250-352, inv comp ARN 

CAGGGCGGCGATGCCGATGGCGATGCCGGCGAACCGCTTGTGCCGCCGGGAGCCGGGGGA 

+ + + + + + 60 

GGGGGAGGTGATGTTGTTGATGTTCTCGGCGATGGGCAGCAGC 
+ + + +— 103 



5) oli 5 FmuvHmv, 338-441, horn ARN 

GGCATCGCCGCCCTGGGCGTGGCCACCGCCGCCCAGGTGACCGCCGCCGTGTCCCTGGTG 

+ + + + + + 60 

CAGGCCCAGACCAACGCCCGCGCCATCGCCGCCATGAAGAACTC 
+ + + + 104 



6) oli 6 FMuvHmv, 427-523, inv comp ARN 

GTCCTGGATGGCCTGCACGGCGATGGCCAGCTGCTGGGTGCCCTCCTTCACCTCGAACAC 

+ + + + + + 60 

GGCGCGGTTGGTGGCCTGGATGGAGTTCTTCATGGCG 
+ + + 97 



7) oli 7 FMUVHM 509-610, horn ARN 

CAGGCCATCCAGGACCACATCAACACCATCATGAACACCCAGCTGAACAACATGTCCTGC 

+ + + + + + 60 

CAGATCCTGGACAACCAGCTGGCCACCTCCCTGGGCCTGTAC 
+ + + +— 102 
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8) oli 8 FMUHM, 595-691, inv comp ARN 

GGACCGCAGGGCCTGGATACTGATGGGggaCAGGGCGGGGTTGATCAGCTGGGGCTGGAA 

+ + + + + + 60 

CACGGTGGTCAGCTCGGTCAGGTACAGGCCCAGGGAG 
+ + + 97 



9) oli 9, 677-778, horn ARN 

CAGGCCCTGCGGTCCCTGCTGGGCAGCATGACCCCCGCCGTGGTGCAGGCCACCCTGAGC 

+ + + + + + 60 

ACCTCCATCAGCGCCGCCGAGATCCTGAGCGCCGGCCTGATG 
+ + + 102 



10) oli 10, FmuvHmv, 763-862, inv comp ARN 

GTTGGACTGGGTCACGATGGTGGGCACGTTGATCTTCACGATCATCTGCATCTCGTCCAG 

+ + + + + + 60 

CAGCACGGACACGATCTGGCCCTCCATCAGGCCGGCGCTC 
+ + + + 100 



11) oli 11, FmuvHmv, 848-949, homARN 

GTGACCCAGTCCAACGCCCTGGTGATCGACTTCTACAGCATCAGCAGCTTCATCAACAAC 

+ + + + + + 60 

CAGGAGTCCATCATCCAGCTGCCCGACCGCATCCTGGAGATC 
x + + +— 102 



12) oli 12 FMUHM, 935-1039, inv comp ARN 

GCTCAGCCGCTCGGCCTCGTTGTACTGGCAGAAGATGTGGTGGCGGGTCAGCTTGCAGTT 

+ + + + + + 60 

CTTGGCGGGGTAGCGCCACTGCTCGTTGCCGATCTCCAGGATGCG 
+ + + + 105 



[SEQ ID NOS: 57-68 respectively] 
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Fig. 38 : Sequence of the 9 oligonucleotides from which PCR fragment B was 
generated. 

12) oli 12 FMUHM, 935-1039, inv compARN 

GCTCAGCCGCTCGGCCTCGTTGTACTGGCAGAAGATGTGGTGGCGGGTCAGCTTGCAGTT 

+ + + + + + 60 

CTTGGCGGGGTAGCGCCACTGCTCGTTGCCGATCTCCAGGATGCG 
+ + — : + + 105 



13) oli 13, 1025-1129, horn ARN 

GCCGAGCGGCTGAGCCTGGAGACCAAGCTGTGCCTGGCCGGCAACATCAGCGCCTGCGTG 

+ + + + + + 60 

TTCTCCAGCATCGCCGGCAGCTACATGCGCCGCTTCGTGGCCCTG 
+ + + + 105 



14) oli 14, 1115-1216, inv comp ARN 

GGCGTGGTGGTCGGGCTGGTAGATGGGGTAGGAGGGGCTCTTGCACAGGCAGGTCAGGCT 

+ + + + + + 60 

GCGGCAGTTGGCCACGATGGTGCCGTCCAGGGCCACGAAGCG 
+ + + +-- 102 



15) oli 15, 1202-1299, horn ARN 

CCCGACCACCACGCCGTGACCACCATCGACCTGACCTCCTGCCAGACCCTGAGCCTGGAC 

+ + — + + + + 60 

GGCCTGGACTTCAGCATCGTGTCCCTGAGCAACATCAC 
+ + + 98 



16) oli 16 1285-1387, inv comp ARN 

CTTGCTCAGCTCGGTGGAGATGTCGATGGGCTGGGTGTTGATGGTCTGGCTCAGGCTGAT 

+ + + + + + 60 

GGTCAGGTTCTCGGCGTAGGTGATGTTGCTCAGG 
+ + + 94 



17) oii 17, 1363-1462, horn ARN 

CACCGAGCTGAGCAAGGTGAACGCCTCCCTGCAGAACGCCGTGAAGTACATCAAGGAGAG 

+ + + + + + 60 

CAACCACCAGCTGCAGAGCGTGAGCGTGAGCAGCAAGCGC 
+ + + + 100 



18) oli 18, 1447-1550, inv comp ARN 

TCACCTGGTGCTCGATGGAGTTGGTCACGTCCAGGTTGGTGCTCAGGCTCTTGTGGATCT 

+ + + + + + 60 

CGGCGGTGTAGATGGCGGCGCGGTGCAGGCGCTTGCTGCTCACG 
+ + + + 104 
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19) oli 19, 1534-1636, horn ARN 

CATCGAGCACCAGGTGAAGGACGTGCTGACCCCCCTGTTCAAGATCATCGGCGACGAGGT 

+ + + + + + 60 

GGGCCTGCGCACCCCCCAGCGCTTCACCGACCTGGTGAAGTTC 
+ + + + 103 



20) oli 20 FmuvHmv, 1622-1718, inv comp ARN 

GCTCGGGGGGGTTGATGCACCAGGTCAGGTCGCGGAAGTCGTACTCGCGGTCGGGGTTCA 

+ + + + + + 60 

GGAACTTGATCTTGTCGGAGATGAACTTCACCAGGTC 
+ + + 97 



[SEQ ID NOS: 69-77 respectively] 
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Fig. 39 : Sequence of the 11 oligonucleotides from which PCR fragment C was 
generated. 

20) oli 20 FmuvHmv, 1622-1718, inv comp ARN 

GCTCGGGGGGGTTGATGCACCAGGTCAGGTCGCGGAAGTCGTACTCGCGGTCGGGGTTCA 
+ + + + + + 60 

GG AAC T T GAT C T T GT C GGAGAT G AAC TT C AC C AG GT C 
+ + + 97 

21) oli 21, FmuvHmv, 1701-1799, horn ARN 

GCATCAACCCCCCCGAGCGGATCAAGCTGGACTACGACCAGTACTGCGCCGACGTGGCCG 

+ + + + + + 60 

CCGAGGAGCTGATGAACGCCCTGGTGAACAGCACCCTGC 
+ + + 99 

22) oli 22,1784-1888, inv comp 

CATGTTGCTGAACTGGCCCCGGATGGTGGTGGGGCCGCTGCAGTTGCCCTTGCTCACGGC 

+ + + + + + 60 

CAGGAACTGGTTGGTGGTGCGGGTCTCCAGCAGGGTGCTGTTCAC 
+ + + + 105 

23) oli 23, 1874-1971, horn ARN 

CAGTTCAGCAACATGAGCCTGTCCCTGCTGGACCTGTACCTGGGCCGGGGCTACAACGTG 

+ + + + + + 60 

AGCAGCATCGTGACCATGACCAGCCAGGGCATGTACGG 
+ + + 98 

24) oli 24, 1957-2057, inv. comp ARN 

CCACCTCGAACACGCGGTACATGCTCAGCTGGCTCAGCTCGCTCCGCTTGCTGCTCAGGT 

+ + + + + + 60 

TGGGCTTCTCCACCAGGTAGGTGCCGCCGTACATGCCCTGG 
+ + + +- 101 

25) oli 25, FmuvHmv, 2043-2140, homARN 

GCGTGTTCGAGGTGGGCGTGATCCGGAACCCCGGCCTGGGCGCCCCCGTGTTCCACATGA 
+ + + 4- 4- + 60 

C C AAC T AC C T G GAG C AGC C C GT G AGC AACG AC C T G AGC 
+ + + 98 

26) oli26, FmuvHmv, 2125-2227, inv compARN 

GCCGCTGCCCTGGTAGGGGATGGTGATGCTGTCCTCGCCGTGGCACAGGGCGGCCAGCTT 

+ 4. 4- + + + 60 

CAGCTCGCCCAGGGCCACCATGCAGTTGCTCAGGTCGTTGCTC 
4- + + +— 103 

27) oli 27, 2212-2309, FmuvHm, horn ARN 

CTACCAGGGCAGCGGCAAGGGCGTGAGCTTCCAGCTGGTGAAGCTGGGCGTGTGGAAGAG 

+ + 4- 4- + + 60 

CCCCACCGACATGCAGAGCTGGGTGCCCCTGAGCACCG 
+ + + 98 
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28) oli 28,FmuvHmv, 2294-2392, inv comp ARN 

GGTGGGCACGGCCCACTTGGCCTGGTTGTCGGCGATCACGCCGCGGTGGCTGCTCAGGTA 

+ + + +- + + 60 

CAGGCGGTCGATCACGGGGTCGTCGGTGCTCAGGGGCAC 
+ + + 99 

29) oli 29, Fmuv Hmv, 2377-2477, horn ARN 

GTGGGCCGTGCCCACCACCCGCACCGACGACAAGCTGCGCATGGAGACCTGCTTCCAGCA 

+ + + + + + 60 

GGCCTGCAAGGGCAAGATCCAGGCCCTGTGCGAGAACCCCG 
+ + + + - 101 

30) oli 30, FmuvHmv 2462-2561, inv comp 

TGATCTTCAGCTCCACGGTCAGGCTCAGGTCCACGCTCAGCACGCCGTAGCTGGGGATGC 

+ + + + + + 60 

GGTTGTCCTTCAGGGGGGCCCAtTCGGGGTTCTCGCACAG 
+ + + + 100 



[SEQ ID NOS: 78-88 respecively] 
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Fig. 40 : Sequence of the 8 oligonucleotides from which PCR fragment D was 
generated. 

30) oli 30, FmuvHmv 2462-2561, inv comp 

TGATCTTCAGCTCCACGGTCAGGCTCAGGTCCACGCTCAGCACGCCGTAGCTGGGGATGC 

+ + + + + + 60 

GGTTGTCCTTCAGGGGGGCCCAtTCGGGGTTCTCGCACAG 
+ 4- + + 100 



31) oli31, FmuvHmv, 2546-2649, hom ARN 

GTGGAGCTGAAGATCAAGATCGCGAGCGGCTTCGGCCCCCTGATCACCCACGGCAGCGGC 

+ + + + + + 60 

ATGGACCTGTACAAGAGCAACCACAACAACGTGTACTGGCTGAC 
+ + + + 104 



32) oli 32, FmuvHmv, 2635-2738, inv comp. ARN 

CGGTGAACAGGTAGGGGCTCACCTTGAAGCGGGGaATCCACTCCAGGGTGTTGATCACGC 

+ + + + + + 60 

CCAGGGCCAGGTTCTTCATGGGGGGGATGGTCAGCCAGTACACG 
+ + + + 104 



33) oli 33, FmuvHmv, 2723-2827, hom ARN 

CCCTACCTGTTCACCGTGCCCATCAAGGAGGCCGGCGAGGACTGCCACGCCCCGACCTAC 

+ + + + + + 60 

CTGCCCGCCGAGGTGGACGGCGACGTGAAGCTGAGCAGCAACCTG 
+ + + + 105 



34) oli 34, FMUVHmv, 2813-2911, inv comp ARN 

CACGTAGTACACCACGGCGTGCTCCACGCGGCTGGTGTCGTAGGTGGCCAGCACGTACTG 

. + + + — : + + + 60 

CAGGTCCTGGCCGGGCAGGATCACCAGGTTGCTGCTCAG 
+ + + 99 



35) oli 35 FMUHM, 2897-2995, homARN 

GTGGTGTACTACGTGTACAGCCCCGGCCGCAGCTTCTTCTACTTCTACCCCTTCCGCCTG 

+ + + + + + 60 

CCCATCAAGGGCGTGCCCATCGAGCTGCAGGTGGAGTGC 
+ + + 99 



36) oli 36, FmuvHmv, 2981-3078, inv comp ARN 

CCGCTGTGGGTGATGTGGCCGCCGCTCTCGCTGTCGGCCAGCACGCAGAAGTGGCGGCAC 

+ + + + + + 60 

CACAGCTTCTGGTCCCAGGTGAAGCACTCCACCTGCAG 
+ + + 98 
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37) oli 37, 3064-3147, homARN 

CATCACCCACAGCGGCATGGTGGGCATGGGCGTGAGCTGCACCGTGACCCGCGAGGACGG 
_; + + + + + + 60 

CACCAACCGCCGCTAGcgaattcc 
+ + 84 



[SEQ ID NOS: 89-96 respectively; 
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Fig. 41 : Construction of pEE14F MuV hum HN M vhum 
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Figure 42 A : Humanised nucleic acids sequence of F MuV H MV (upper 
sequence) compared to the original F MuV H MV sequence (lower sequence) and 
the corresponding amino acids sequence. 



14 ATGAAGGCGTTCCCCGTGATCTGCCTGGGCTTCGCCATCTTCTCCAGCAG 63 
I I I I I I I I II II II II III I I I I I I I 11 I I I I I II I 
1 ATGAAGGCTTTTCCAGTTATTTGCTTGGGCTTTGCAATCTTTTCATCCTC 50 

64 CATCTGCGTGAACATCAACATCCTGCAGCAGATCGGATACATCAAGCAGC 113 

II II I I II I Mill III I I I I I I I ! I I I I I I! I I I I I I I I I 
51 T AT ATG T GT G AAT AT C AAT AT C TT GC AGCAAAT T GG AT AC AT CAAGC AAC 100 

114 AGGTGAGGCAGCTGAGCTACTACTCCCAGAGCTCCAGCTCCTACGTGGTG 163 

I I I I I I I I I II I I I I I I I I I I II II II I I I I I I I I I I I III 
101 AGGTCAGGCAACTAAGCTATTACTCACAAAGTTCAAGCTCCTACGTAGTG 150 

164 GTCAAGCTGCTGCCCAACATCCAGCCCACCGACAACAGCTGCGAGTTCAA 213 

MINIM 1 II II Mill I I I I I M I I II I I I I II II M 
151 GTCAAGCTTTTACCGAATATCCAACCCACTGATAACAGCTGTGAATTTAA 200 

214 GAGCGTGACCCAGTACAACAAGACCCTGAGCAACCTGCTGCTGCCCATCG 263 

E I I I I II II I I I I I II I I II I I I I II I II I M M M I 
201 GAGTGTAACTCAATACAATAAGACCTTGAGTAATTTGCTTCTTCCAATTG 250 

264 CCGAGAACATCAACAACATCACCTCCCCCTCCCCCGGCTCCCGGCGGCAC 313 

I II I II I I I I I I I II II M Mill M II II I I I II 

251 CAGAAAACATAAACAATATTACGTCGCCCTCACCTGGGTCAAGACGTCAT 300 

314 AAGCGGTTCGCCGGCATCGCCATCGGCATCGCCGCCCTGGGCGTGGCCAC 363 

II I I II I II I I II I Mill Mill II II I I I II II It II 

301 AAACGGTTTGCTGGCATTGCCATTGGCATTGCgGCcCTCGGTGTTGCGAC 350 

364 CGCCGCCCAGGTGACCGCCGCCGTGTCCCTGGTGCAGGCCCAGACCAACG 413 

III II II Mill Mill II II I M II II Mill M f 

351 CGCAGCACAAGTGACTGCCGCTGTCTCATTAGTTCAAGCACAGACAAATG 4 00 

414 CCCGCGCCATCGCCGCCATGAAGAACTCCATCCAGGCCACCAACCGCGCC 4 63 

I II II II II II I II II II M II MEM II M II M 

401 CACGTGCAATAGCAGCGATGAAAAATTCAATACAGGCAACTAATCGGGCA 4 50 

4 64 GTGTTCGAGGTGAAGGAGGGCACCCAGCAGCTGGCCATCGCCGTGCAGGC 513 

II I I II I I II M I M I I I M I II III I II II M II II II 

451 GTCTTCGAAGTGAAGGAAGGCACCCAACAGTTAGCTATAGCGGTACAAGC 500 

514 CATCCAGGACCACATCAACACCATCATGAACACCCAGCTGAACAACATGT 563 

II I IE I Mill I II I I M II I II I II II I I I I II I I II I II I 
501 cATcCAAGACCATATCAATACTATTATGAACACCCAATTGAACAATATGT 550 



WO 00/18929 



PCI7EP99/07004 



61/73 



564 CCTGCCAGATCCTGGACAACCAGCTGGCCACCTCCCTGGGCCTGTACCTG 613 

I M I I I I I I I I II I I I I I I I I I I I I I I I I I I II I Mill 
551 CTTGTCAGATCCTTGATAACCAGCTTGCAACCTCCCTAGGATTATACCTA 600 

614 ACCGAGCTGACCACCGTGTTCCAGCCCCAGCTGATCAACCCCGCCCTGtC 663 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
601 ACAGAATTAACAACAGTGTTTCAGCCACAATTAATTAATCCAGCATTGTC 650 

664 cCCCATCAGTATCCAGGCCCTGCGGTCCCTGCTGGGCAGCATGACCCCCG 713 

II If I I I I I II III II I I I I I I I I II II I I I I I II I 
651 ACCGATTAGTATACAAGCCTTGAGGTCTTTGCTTGGAAGTATGACACCTG 700 

714 CCGTGGTGCAGGCCACCCTGAGCACCTCCATCAGCGCCGCCGAGATCCTG 763 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I 

701 CAGTGGTTCAAGCAACATTATCTACTTCAATTTCTGCTGCTGAAATACTA 750 

7 64 AGCGCCGGCCTGATGGAGGGCCAGATCGTGTCCGTGCTGCTGGACGAGAT 813 

II I I I I I II I I I I I I II I I I I I II II II I I I I I II I I I I I 

751 AGTGCCGGTCTAATGGAGGGTCAGATAGTTTCTGTTCTGCTAGATGAGAT 800 

814 GCAGATGATCGTGAAGATCAACGTGCCCACCATCGTGACCCAGTCCAACG 863 
I I I I I I I I I II I I I I I I I I I I II I I I I I II II M II II I 

8 01 GCAGATGATAGTTAAGATAAACGTTCCAACCATTGTCACACAATCAAATG 850 

8 64 CCCTGGTGATCGACTTCTACAGCATCAGCAGCTTCATCAACAACCAGGAG 913 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 

8 51 CAT T GG T G AT TGAC T T C T AC T C AAT T T C G AGT T T TAT TAATAAT CAAGAA 900 

914 T CC AT C AT C C AGCT GC C CGAC C GC AT CCTGGAG AT CGGCAACG AG CAGTG 963 

I I I I I II II I I I I III I III I I I I I I I I I I II II II II 
901 TCCATAATTCAATTGCCAGACAGGATCTTGGAGATCGGAAATGAACAATG 950 

964 GCGCTACCCCGCCAAGAACTGCAAGCTGACCCGCCACCACATCTTCTGCC 1013 

I II I! I II II I I I I I II III I I I I I I I I I I If I I I I I I I I 
951 GCGCTATCCAGCTAAGAATTGTAAGTTGACAAGACACCACATATTCTGCC 1000 

1014 AGTACAACGAGGCCGAGCGGCTGAGCCTGGAGACCAAGCTGTGCCTGGCC 1063 

I I I I I I I I I I I III I I I I I I I I I I I I II II II I I I I I II 
1001 AATACAATGAGGCAGAGAGGCTGAGCCTAGAAACAAAACTATGCCTTGCA 1050 

1064 GGCAACATCAGCGCCTGCGTGTTCTCCAGCATCGCCGGCAGCTACATGCG 1113 

Mill II II I I I I I M I II I II M II II II I I III I 
1051 GGCAATATTAGTGCCTGTGTGTTCTCATCTATAGCAGGGAGTTATATGAG 1100 

■ • • • • 

1114 CCGCTTCGTGGCCCTGGACGGCACCATCGTGGCCAACTGCCGCAGCCTGA 1163 

II II II II II II I II II II II II I I II I II II II I 
1101 GCGATTTGTAGCACTGGATGGAACAATTGTTGCAAACTGTCGAAGTCTAA 1150 

1164 CCTGCCTGTGCAAGAGCCCCTCCTACCCCATCTACCAGCCCGACCACCAC 1213 

I II II I II I I I II li II II II II I II I I II I I II I II 
1151 CGTGTCTATGCAAGAGTCCATCTTATCCTATATACCAACCTGACCATCAT 1200 
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1214 GCCGTGACCACCATCGACCTGACCTCCTGCCAGACCCTGAGCCTGGACGG 1263 

I I I I I I I I I I I I I II I I I I I I I I I I I f MINIMI 
1201 GCAGTCACGACCATTGATCTAACGTCATGTCAAACATTGTCCCTGGACGG 1250 

1264 CCTGGACTTCAGCATCGTGTCCCTGAGCAACATCACCTACGCCGAGAACC 1313 

Mill II I I M M M II M II I I II II II I Mill Mill I 
1251 ACTGGATTTCAGCATTGTCTCGCTAAGCAACATCACTTACGCTGAGAATC 1300 

1314 TGACCATCAGCCTGAGCCAGACCATCAACACCCAGCCCATCGACATCTCC 1363 

Mill II I M M I II II It II M II I II II II 

1301 TTACTATTTCATTGTCTCAGACAATCAATACTCAACCCATTGATATATCA 1350 

13 64 ACCGAGCTGAGCAAGGTGAACGCCTCCCTGCAGAACGCCGTGAAGTACAT 1413 

M I I II I M I I II II II II Mill II II II II I It II II I 
1351 ACTGAGCTGAGTAAGGTTAATGCATCCCTCCAAAATGCCGTTAAATACAT 14 00 

1414 CAAGGAGAGCAACCACCAGCTGCAGAGCGTGAGCGTGAGCAGCAAGCGCC 14 63 

! I I M I! I II II M M II I II II II I I II II 

1401 AAAAGAGAGTAACCATCAACTCCAATCCGTTAGTGTAAGTTCTAAAAGAC 1450 

14 64 TGCACCGCGCCGCCATCTACACCGCCGAGATCCACAAGAGCCTGAGCACC 1513 

I I I M II M II M II I II II I II I I I II I II Mill II I I I I 
14 51 TTCATCGGGCAGCCATCTACACCGCAGAGATCCATAAAAGCCTCAGCACC 1500 

1514 AACCTGGACGTGACCAACTCCATCGAGCACCAGGTGAAGGACGTGCTGAC 1563 

M II II II II II II I II II II I I Mill I! II II II I! II II 
1501 AATCTAGATGTAACTAACTCAATCGAGCATCAGGTCAAGGACGTGCTGAC 1550 

1564 CCCCCTGTTCAAGATCATCGGCGACGAGGTGGGCCTGCGCACCCCCCAGC 1613 

II M I II I I II II I II I II II Ml I II I I I I II II III 

1551 ACCACTCTTCAAAATCATCGGTGATGAAGTGGGCCTGAGGACACCTCAGA 1600 

1614 GCTTCACCGACCTGGTGAAGTTCATCTCCGACAAGATCAAGTTCCTGAAC 1663 

I Mill EMM I I I II M II I I II I II M I M II II I I I II 
1601 GATTCACTGACCTAGTGAAATTCATCTCTGACAAGATTAAATTCCTTAAT 1650 

1664 CCCGACCGCGAGTACGACTTCCGCGACCTGACCTGGTGCATCAACCCCCC 1713 

M II I II II II II II I I I II II II I II II I I II II I I II 
1651 CCGGATAGGGAGTACGACTTCAGAGATCTCACTTGGTGTATCAACCCGCC 1700 

1714 CGAGCGGATCAAGCTGGACTACGACCAGTACTGCGCCGACGTGGCCGCCG 1763 

III I II I II I I II II M II IMM II II Mill II I 
1701 AGAGAGAATCAAATTGGATTATGATCAATACTGTGCAGATGTGGCTGCTG 17 50 

1764 AGGAGCTGATGAACGCCCTGGTGAACAGCACCCTGCTGGAGACCCGCACC 1813 

I I II I I Mill II I II It I II IE II II I II I I I I I II 
17 51 AAGAGCTCATGAATGCATTGGTGAACTCAACTCTACTGGAGACCAGAACA 1800 

1814 ACCAACCAGTTCCTGGCCGTGAGCAAGGGCAACTGCAGCGGCCCCACCAC 1863 

I II II II II I I I I M M I I I II I I I I II II II I I I M 
1801 ACCAATCAGTTCCTAGCTGTCTCAAAGGGAAACTGCTCAGGGCCCACTAC 1850 
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1864 CATCCGGGGCCAGTTCAGCAACATGAGCCTGTCCCTGCTGGACCTGTACC 1913 

III I II II III I I I I I I II I I I I I I I I III I I I I 
1851 AATCAGAGGTCAATTCTCAAACATGTCGCTGTCCCTGTTAGACTTGTATT 1900 

1914 TGGGCCGGGGCTACAACGTGAGCAGCATCGTGACCATGACCAGCCAGGGC 1963 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 

1901 TAGGTCGAGGTTACAATGTGTCATCTATAGTCACTATGACATCCCAGGGA 1950 

1964 ATGTACGGCGGCACCTACCTGGTGGAGAAGCCCAACCTGAGCAGCAAGCG 2013 

Mill II II II I I I I I Mill I II I I II M II I I I I I II I 
1951 ATGTATGGGGGAACTTACCTAGTGGAAAAGCCTAATCTGAGCAGCAAAAG 2000 

2014 GAGCGAGCTGAGCCAGCTGAGCATGTACCGCGTGTTCGAGGTGGGCGTGA 2063 

I III II II I I I I M I I I I I I I I Mill II II M II I 
2001 GTCAGAGTTGTCACAACTGAGCATGTACCGAGTGTTTGAAGTAGGTGTTA 2050 

2064 TCCGGAACCCCGGCCTGGGCGCCCCCGTGTTCCACATGACCAACTACCTG 2113 

II I II II II I I I I II II I II II I I I Mill II II I EE 

2051 TCAGAAATCCGGGTTTGGGGGCTCCGGTGTTCCATATGACAAACTATCTT 2100 

2114 GAGCAGCCCGTGAGCAACGACCTGAGCAACTGCATGGTGGCCCTGGGCGA 2163 

I I I I I II II II II M II I I I II I II II II II M MM II 
2101 GAGCAACCAGTCAGTAATGATCTCAGCAACTGTATGGTGGCTTTGGGGGA 2150 

2164 GCTGAAGCTGGCCGCCCTGTGCCACGGCGAGGACAGCATCACCATCCCCT 2213 

III M II II Mill M I II II II M Mill M II II 
2151 GCTCAAACTCGCAGCCCTTTGTCACGGGGAAGATTCTATCACAATTCCCT 2200 

2214 ACCAGGGCAGCGGCAAGGGCGTGAGCTTCCAGCTGGTGAAGCTGGGCGTG 2263 

I I I I II II II II II II I II II II I I II MM! II M 
2201 ATCAGGGATCAGGGAAAGGTGTCAGCTTCCAGCTCGTCAAGCTAGGTGTC 2250 

2264 TGGAAGAGCCCCACCGACATGCAGAGCTGGGTGCCCCTGAGCACCGACGA 2313 

I I II I II I I I II M II II I II II I I II I I M II I I 
2251 TGGAAATCCCCAACCGACATGCAATCCTGGGTCCCCTTATCAACGGATGA 2300 

2 314 CCCCGTGATCGACCGCCTGTACCTGAGCAGCCACCGCGGCGTGATCGCCG 2363 

M I I II I II I I II I M II If I I II M II II I I 

2301 TCCAGTGATAGACAGGCTTTACCTCTCATCTCACAGAGGTGTTATCGCTG 2350 

2364 ACAACCAGGCCAAGTGGGCCGTGCCCACCACCCGCACCGACGACAAGCTG 2413 

II I II II II II II I II II II II II II II II II I I II II 
2351 ACAAcCAAGCAAAATGGGCTGTCCCGACAACACGAACAGATGACAAGTTG 2400 

2 414 CGCATGGAGACCTGCTTCCAGCAGGCCTGCAAGGGCAAGATCCAGGCCCT 24 63 

II II 11 I II I II II II I I II II I II II I I I M I I II I II II 
2401 CGAATGGAGACATGCTTCCAACAGGCGTGTAAGGGTAAAATCCAAGCACT 2450 

2464 GTGCGAGAACCCCGAaTGGGCCCCCCTGAAGGACAACCGCATCCCCAGCT 2513 

II I II I I I II I I I I II I I II I I I II I I I I I I I I M I 
2451 CTGCGAGAATCCCGAGTGGGCACCATTGAAGGATAACAGGATTCCTTCAT 2500 
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2514 ACGGCGTGCTGAGCGTGGACCTGAGCCTGACCGTGGAGCTGAAGATCAAG 2563 

I I I I I I II II I I I I I I I I I I M I ! I I | | | | | | | II | 
2501 ACGGGGTCTTGTCTGTTGATCTGAGTCTGACAGTTGAGCTTAAAATCAAA 2550 

2564 ATCGCGAGCGGCTTCGGCCCCCTGATCACCCACGGCAGCGGCATGGACCT 2613 

III! I I I I I I I I I I I I! I II I I I I I I I I I I I I I I I 

2 551 ATTGCTTCGGGATTCGGGCCATTGATCACACACGGTTCAGGGATGGACCT 2 600 

2 614 GTACAAGAGCAACCACAACAACGTGTACTGGCTGACCATCCCCCCCATGA 2 663 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 11 II I I I I 
2 601 ATACAAATCCAACCACAACAATGTGTATTGGCTGACTATCCCGCCAATGA 2 650 

2 664 AGAACCTGGCCCTGGGCGTGATCAACACCCTGGAGTGGATtCCCCGCTTC 2713 

M I I I I I III I II II I I I I I I I I I I I I I I II I I II I III 
2 651 AGAACCTAGCCTTAGGTGTAATCAACACATTGGAGTGGATACCGAGATTC 2700 



2714 AAGGTGAGCCCCTACCTGTTCACCGTGCCCATCAAGGAGGCCGGGGAGGA 2763 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I II II I I I IE 
2701 AAGGTTAGTCCCTACCTCTTCAcTGTCCCAATTAAGGAAGCAGGCGAAGA 2750 

27 64 CTGCCACGCCCCGACCTACCTGCCCGCCGAGGTGGACGGCGACGTGAAGC 2813 
I I I I M I I I I I II I I I ! I II II I I I I I I I I II II II II I 

27 51 CTGCCATGCCCCAACATACCTACCTGCGGAGGTGGATGGTGATGTCAAAC 2800 

2814 TGAGCAGCAACCTGGTGATCCTGCCCGGCCAGGACCTGCAGTACGTGCTG 2863 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
2801 TCAGTTCCAATCTGGTGATTCTACCTGGTCAAGATCTCCAATATGTTTTG 2850 

28 64 GCCACCTACGACACCAGCCGCGTGGAGCACGCCGTGGTGTACTACGTGTA 2913 

II I I I I I I I I II I I I I II II II Mill II I I I II II 
2851 GCAACCTACGATACTTCCAGGGTTGAACATGCTGTGGTTTATTACGTTTA 2900 

2914 CAGCCCCGGCCGCAGCTTCTTCTACTTCTACCCCTTCCGCCTGCCCATCA 2963 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
2901 CAGCCCAgGCCGCTCATTTTtTTACTTTTATCCTTTTAGGTTGCCTATAA 2950 

2964 AGGGCGTGCCCATCGAGCTGCAGGTGGAGTGCTTCACCTGGGACCAGAAG 3013 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I II 

2951 AGGGGGTCCCCATCGAATTACAAGTGGAATGCTTCACATGGGACCAAAAA 3000 

3014 CTGTGGTGCCGCCACTTCTGCGTGCTGGCCGACAGCGAGAGCGGCGGCCA 3063 

II I I I I I I I I I I II I 1 I I I I I I I II III II II II II 
3001 CTCTGGTGCCGTCACTTCTGTGTGCTTGCGGACTCAGAATCTGGTGGACA 3050 

3064 CATCACCCACAGCGGCATGGTGGGCATGGGCGTGAGCTGCACCGTGACCC 3113 

I I I I I III II I I I I I I I I I I I I I I II I I I I I I II II I I I I 
3051 TATCACTCACTCTGGGATGGtGGGCATGGGAGTCAGCTGCACAGTCACCC 3100 



3114 GCGAGGACGGCACCAACCGCCGCTAG 3139 

I II II II I I I I I III I III 
3101 GGGAAGATGGAACCAATCGCAGATAG 3126 
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Fig. 42B : F MU H MV .seq check: 4381 from: 1 to: 3126 
nucleic acid sequence of F MU H MV (non humanised) 

ATGAAGGCTTTTCCAGTTATTTGCTTGGGCTTTGCAATCTTTTCATCCTC 
TATATGTGTGAATATCAATATCTTGCAGCAAATTGGATACATCAAGCAAC 
AGGTCAGGCAACTAAGCTATTACTCACAAAGTTCAAGCTCCTACGTAGTG 
GTCAAGCTTTTACCGAATATCCAACCCACTGATAACAGCTGTGAATTTAA 
GAGTGTAACTCAATACAATAAGACCTTGAGTAATTTGCTTCTTCCAATTG 
CAGAAAACATAAACAATATTACGTCGCCCTCACCTGGGTCAAGACGTCAT 
AAACGGTTTGCTGGCATTGCCATTGGCATTGCgGCcCTCGGTGTTGCGAC 
C G C AG C AC AAG T G AC TGCCGCTGTCT CAT T AG TT C AA G C AC AG AC AAAT G 
CACGTGCAATAGCAGCGATGAAAAATTCAATACAGGCAACTAATCGGGCA 
GTCTTCGAAGTGAAGGAAGGCACCCAACAGTTAGCTATAGCGGTACAAGC 
cATcCAAGACCATATCAATACTATTATGAACACCCAATTGAACAATATGT 
CTTGTCAGATCCTTGATAACCAGCTTGCAACCTCCCTAGGATTATACCTA 
ACAGAATTAACAACAGTGTTTCAGCCACAATTAATTAATCCAGCATTGTC 
ACCGATTAGTATACAAGCCTTGAGGTCTTTGCTTGGAAGTATGACACCTG 
CAGTGGTTCAAGCAACATTATCTACTTCAATTTCTGCTGCTGAAATACTA 
AGTGCCGGTCTAATGGAGGGTCAGATAGTTTCTGTTCTGCTAGATGAGAT 
GCAGATGATAGTTAAGATAAACGTTCCAACCATTGTCACACAATCAAATG 
CATTGGTGATTGACTTCTACTCAATTTCGAGTTTTATTAATAATCAAGAA 
TCCATAATTCAATTGCCAGACAGGATCTTGGAGATCGGAAATGAACAATG 
GCGCTATCCAGCTAAGAATTGTAAGTTGACAAGACACCACATATTCTGCC 
AATACAATGAGGCAGAGAGGCTGAGCCTAGAAACAAAACTATGCCTTGCA 
GGCAATATTAGTGCCTGTGTGTTCTCATCTATAGCAGGGAGTTATATGAG 
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GCGATTTGTAGCACTGGATGGAACAATTGTTGCAAACTGTCGAAGTCTAA 

CGTGTCTATGCAAGAGTCCATCTTATCCTATATACCAACCTGACCATCAT 

GCAGTCACGACCATTGATCTAACGTCATGTCAAACATTGTCCCTGGACGG 

ACTGGATTTCAGCATTGTCTCGCTAAGCAACATCACTTACGCTGAGAATC 

TTACTATTTCATTGTCTCAGACAATCAATACTCAACCCATTGATATATCA 

ACTGAGCTGAGTAAGGTTAATGCATCCCTCCAAAATGCCGTTAAATACAT 

AAAAGAGAGTAACCATCAACTCCAATCCGTTAGTGTAAGTTCTAAAAGAC 

TTCATCGGGCAGCCATCTACACCGCAGAGATCCATAAAAGCCTCAGCACC 

AATCTAGATGTAACTAACTCAATCGAGCATCAGGTCAAGGACGTGCTGAC 

ACCACTCTTCAAAATCATCGGTGATGAAGTGGGCCTGAGGACACCTCAGA 

GATTCACTGACCTAGTGAAATTCATCTCTGACAAGATTAAATTCCTTAAT 

CCGGATAGGGAGTACGACTTCAGAGATCTCACTTGGTGTATCAACCCGCC 

AGAGAGAATCAAATTGGATTATGATCAATACTGTGCAGATGTGGCTGCTG 

AAGAGCTCATGAATGCATTGGTGAACTCAACTCTACTGGAGACCAGAACA 

ACCAATCAGTTCCTAGCTGTCTCAAAGGGAAACTGCTCAGGGCCCACTAC 

AATCAGAGGTCAATTCTCAAACATGTCGCTGTCCCTGTTAGACTTGTATT 

TAGGTCGAGGTTACAATGTGTCATCTATAGTCACTATGACATCCCAGGGA 

ATGTATGGGGGAACTTACCTAGTGGAAAAGCCTAATCTGAGCAGCAAAAG 

GTCAGAGTTGTCACAACTGAGCATGTACCGAGTGTTTGAAGTAGGTGTTA 

TCAGAAATCCGGGTTTGGGGGCTCCGGTGTTCCATATGACAAACTATCTT 

GAGCAACCAGTCAGTAATGATCTCAGCAACTGTATGGTGGCTTTGGGGGA 

GCTC7WVCTCGCAGCCCTTTGTCACGGGGAAGATTCTATCACAATTCCCT 

ATCAGGGATCAGGGAAAGGTGTCAGCTTCCAGCTCGTCAAGCTAGGTGTC 

TGGAAATCCCCAACCGACATGCAATCCTGGGTCCCCTTATCAACGGATGA 
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TCCAGTGATAGACAGGCTTTACCTCTCATCTCACAGAGGTGTTATCGCTG 
ACAAcCAAGCAAAATGGGCTGTCCCGACAACACGAACAGATGACAAGTTG 
CGAATGGAGACATGCTTCCAACAGGCGTGTAAGGGTAAAATCCAAGCACT 
CTGCGAGAATCCCGAGTGGGCACCATTGAAGGATAACAGGATTCCTTCAT 
ACGGGGTCTTGTCTGTTGATCTGAGTCTGACAGTTGAGCTTAAAATCAAA 
ATTGCTTCGGGATTCGGGCCATTGATCACACACGGTTCAGGGATGGACCT 
ATACAAATCCAACCACAACAATGTGTATTGGCTGACTATCCCGCCAATGA 
AGAACCTAGCCTTAGGTGTAATCAACACATTGGAGTGGATACCGAGATTC 
AAGGTTAGTCCCTACCTCTTCAcTGTCCCAATTAAGGAAGCAGGCGAAGA 
CTGCCATGCCCCAACATACCTACCTGCGGAGGTGGATGGTGATGTCAAAC 
TCAGTTCCAATCTGGTGATTCTACCTGGTCAAGATCTCCAATATGTTTTG 
GCAACCTACGATACTTCCAGGGTTGAACATGCTGTGGTTTATTACGTTTA 
CAGCCCAgGCCGCTCATTTTtTTACTTTTATCCTTTTAGGTTGCCTATAA 
AGGGGGTCCCCATCGAATTACAAGTGGAATGCTTCACATGGGACCAAAAA 
CTCTGGTGCCGTCACTTCTGTGTGCTTGCGGACTCAGAATCTGGTGGACA 
TATCACTCACTCTGGGATGGtGGGCATGGGAGTCAGCTGCACAGTCACCC 
GGGAAGATGGAACCAATCGCAGATAG 
[SEQ ID NO: 97] 
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Fig 42C: F MUV humH M hum.seq check: 5778 from: 14 to: 3139 
Humanised nucleic acids sequence of F MuV H Mv 

ATGAAGGCGTTCCCCGTGATCTGCCTGGGCTTCGCCATCTTCTCCAGCAG 
CATCTGCGTGAACATCAACATCCTGCAGCAGATCGGATACATCAAGCAGC 
AGGTGAGGCAGCTGAGCTACTACTCCCAGAGCTCCAGCTCCTACGTGGTG 
GTCAAGCTGCTGCCCAACATCCAGCCCACCGACAACAGCTGCGAGTTCAA 
GAGCGTGACCCAGTACAACAAGACCCTGAGCAACCTGCTGCTGCCCATCG 
CCGAGAACATCAACAACATCACCTCCCCCTCCCCCGGCTCCCGGCGGCAC 
AAGCGGTTCGCCGGCATCGCCATCGGCATCGCCGCCCTGGGCGTGGCCAC 
CGCCGCCCAGGTGACCGCCGCCGTGTCCCTGGTGCAGGCCCAGACCAACG 
CCCGCGCCATCGCCGCCATGAAGAACTCCATCCAGGCCACCAACCGCGCC 
GTGTTCGAGGTGAAGGAGGGCACCCAGCAGCTGGCCATCGCCGTGCAGGC 
CATCCAGGACCACATCAACACCATCATGAACACCCAGCTGAACAACATGT 
CCTGCCAGATCCTGGACAACCAGCTGGCCACCTCCCTGGGCCTGTACCTG 
ACCGAGCTGACCACCGTGTTCCAGCCCCAGCTGATCAACCCCGCCCTGtc 
cCCCATCAGTATCCAGGCCCTGCGGTCCCTGCTGGGCAGCATGACCCCCG 
CCGTGGTGCAGGCCACCCTGAGCACCTCCATCAGCGCCGCCGAGATCCTG 
AGCGCCGGCCTGATGGAGGGCCAGATCGTGTCCGTGCTGCTGGACGAGAT 
GCAGATGATCGTGAAGATCAACGTGCCCACCATCGTGACCCAGTCCAACG 
CCCTGGTGATCGACTTCTACAGCATCAGCAGCTTCATCAACAACCAGGAG 
TCCATCATCCAGCTGCCCGACCGCATCCTGGAGATCGGCAACGAGCAGTG 
GCGCTACCCCGCCAAGAACTGCAAGCTGACCCGCCACCACATCTTCTGCC 
AGTACAACGAGGCCGAGCGGCTGAGCCTGGAGACCAAGCTGTGCCTGGCC 
GGCAACATCAGCGCCTGCGTGTTCTCCAGCATCGCCGGCAGCTACATGCG 
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CCGCTTCGTGGCCCTGGACGGCACCATCGTGGCCAACTGCCGCAGCCTGA 
CCTGCCTGTGCAAGAGCCCCTCCTACCCCATCTACCAGCCCGACCACCAC 
GCCGTGACCACCATCGACCTGACCTCCTGCCAGACCCTGAGCCTGGACGG 
CCTGGACTTCAGCATCGTGTCCCTGAGCAACATCACCTACGCCGAGAACC 
TGACCATCAGCCTGAGCCAGACCATCAACACCCAGCCCATCGACATCTCC 
ACCGAGCTGAGCAAGGTGAACGCCTCCCTGCAGAACGCCGTGAAGTACAT 
CAAGGAGAGCAACCACCAGCTGCAGAGCGTGAGCGTGAGCAGCAAGCGCC 
TGCACCGCGCCGCCATCTACACCGCCGAGATCCACAAGAGCCTGAGCACC 
AACCTGGACGTGACCAACTCCATCGAGCACCAGGTGAAGGACGTGCTGAC 
CCCCCTGTTCAAGATCATCGGCGACGAGGTGGGCCTGCGCACCCCCCAGC 
GCTTCACCGACCTGGTGAAGTTCATCTCCGACAAGATCAAGTTCCTGAAC 
CCCGACCGCGAGTACGACTTCCGCGACCTGACCTGGTGCATCAACCCCCC 
CGAGCGGATCAAGCTGGACTACGACCAGTACTGCGCCGACGTGGCCGCCG 
AGGAGCTGATGAACGCCCTGGTGAACAGCACCCTGCTGGAGACCCGCACC 
ACCAACCAGTTCCTGGCCGTGAGCAAGGGCAACTGCAGCGGCCCCACCAC 
CATCCGGGGCCAGTTCAGCAACATGAGCCTGTCCCTGCTGGACCTGTACC 
TGGGCCGGGGCTACAACGTGAGCAGCATCGTGACCATGACCAGCCAGGGC 
ATGTACGGCGGCACCTACCTGGTGGAGAAGCCCAACCTGAGCAGCAAGCG 
GAGCGAGCTGAGCCAGCTGAGCATGTACCGCGTGTTCGAGGTGGGCGTGA 
TCCGGAACCCCGGCCTGGGCGCCCCCGTGTTCCACATGACCAACTACCTG 
GAGCAGCCCGTGAGCAACGACCTGAGCAACTGCATGGTCGCCCTGGGCGA 
GCTGAAGCTGGCCGCCCTGTGCCACGGCGAGGACAGCATCACCATCCCCT 
ACCAGGGCAGCGGCAAGGGCGTGAGCTTCCAGCTGGTGAAGCTGGGCGTG 
TGGAAGAGCCCCACCGACATGCAGAGCTGGGTGCCCCTGAGCACCGACGA 
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CCCCGTGATCGACCGCCTGTACCTGAGCAGCCACCGCGGCGTGATCGCCG 
ACAACCAGGCCAAGTGGGCCGTGCCCACCACCCGCACCGACGACAAGCTG 
CGCATGGAGACCTGCTTCCAGCAGGCCTGCAAGGGCAAGATCCAGGCCCT 
GTGCGAGAACCCCGAaTGGGCCCCCCTGAAGGACAACCGCATCCCCAGCT 
ACGGCGTGCTGAGCGTGGACCTGAGCCTGACCGTGGAGCTGAAGATCAAG 
ATCGCGAGCGGCTTCGGCCCCCTGATCACCCACGGCAGCGGCATGGACCT 
GTACAAGAGCAACCACAACAACGTGTACTGGCTGACCATCCCCCCCATGA 
AGAACCTGGCCCTGGGCGTGATCAACACCCTGGAGTGGATtCCCCGCTTC 
AAGGTGAGCCCCTACCTGTTCACCGTGCCCATCAAGGAGGCCGGCGAGGA 
CTGCCACGCCCCGACCTACCTGCCCGCCGAGGTGGACGGCGACGTGAAGC 
TGAGCAGCAACCTGGTGATCCTGCCCGGCCAGGACCTGCAGTACGTGCTG 
GCCACCTACGACACCAGCCGCGTGGAGCACGCCGTGGTGTACTACGTGTA 
CAGCCCCGGCCGCAGCTTCTTCTACTTCTACCCCTTCCGCCTGCCCATCA 
AGGGCGTGCCCATCGAGCTGCAGGTGGAGTGCTTCACCTGGGACCAGAAG 
CTGTGGTGCCGCCACTTCTGCGTGCTGGCCGACAGCGAGAGCGGCGGCCA 
CATCACCCACAGCGGCATGGTGGGCATGGGCGTGAGCTGCACCGTGACCC 
GCGAGGACGGCACCAACCGCCGCTAG 
[SEQ ID NO: 98] 
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